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Abstract. The probability Psucccss(a, A'^) that stochastic greedy algorithms successfully solve the random 
SATisfiability problem is studied as a function of the ratio a of constraints per variable and the number 
of variables. These algorithms assign variables according to the unit-propagation (UP) rule in presence 
of constraints involving a unique variable (1-clauses), to some heuristic (H) prescription otherwise. In 
the infinite A'^ limit, Psucccss vanishes at some critical ratio an which depends on the heuristic H. We 
show that the critical behaviour is determined by the UP rule only. In the case where only constraints 
with 2 and 3 variables are present, we give the phase diagram and identify two universality classes: the 
power law class, where Psuccess[Q;if(l -I- eA'^"'^''^), A'^] ~ A{e)IN'^\ the stretched exponential class, where 
fsucccss[aH(l -I- tN~^l^),N\ ~ exp[— A'^'^'^® ^(e)]. Which class is selected depends on the characteristic 
parameters of input data. The critical exponent 7 is universal and calculated; the scaling functions A and 
$ weakly depend on the heuristic H and are obtained from the solutions of reaction-diffusion equations for 
1-clauses. Computation of some non-universal corrections allows us to match numerical results with good 
precision. The critical behaviour for constraints with > 3 variables is given. Our results are interpreted 
in terms of dynamical graph percolation and we argue that they should apply to more general situations 
where UP is used. 

PACS. 89.20.Ff Computer science and technology - 05. 40. -a Fluctuation phenomena, random processes, 
noise, and Brownian motion - 02.50.Ey Stochastic processes - 89. 75. Da Systems obeying scaling laws 



1 Introduction 

Many computational problems rooted in practical appli- 
cations or issued from theoretical considerations are con- 
sidered to be very difScult in that all exact algorithms 
designed so far have (worst-case) running times growing 
exponentially with the size N of the input. To tackle 
such problems one is thus enticed to look for random- 
ized (stochastic) polynomial-time algorithms, guaranteed 
to run fast but not to find a solution. The key estimator of 
the performances of these search procedures is the proba- 
bility of success, Psuccess(A^), that is, the probability over 
the stochastic choices carried out by the algorithm that a 
solution is found (when it exists) in polynomial time, say, 
less than N . Roughly speaking, depending on the problem 
to be solved and the nature of the input, two cases may 
arise in the large N limit ^ 

p — l ^(1) (success case) . x 

-^succoss(^V) - I gxp(_g)(^)) (failure case). 

When Psucccss is bounded from below by a strictly positive 
number at large iV, a few runs will be sufficient to provide 
a solution or a (probabilistic) proof of absence of solution; 
this case is referred to as success case in the following. Un- 
fortunately, in many cases, Psucccss is exponentially small 
in the system's size and an exponentially large number of 
runs is necessary to reach a conclusion about the existence 

* Preprint LPTENS-05/24. 

^ Recall that f{x) = 0{x) means that there exist three 
strictly positive real numbers X,a,b such that, Va; > X, 
ax < f{x) < bx. 



or not of a solution; this situation is hereafter denoted by 
failure case. 

An example is provided by the K-SAT problem, infor- 
mally defined as follows. An instance of K-SAT is a set 
of constraints (called clauses) over a set of Boolean vari- 
ables. Each constraint is the logical OR of K variables or 
their negations. Solving an instance means either finding 
an assignment of the variables that satisfies all clauses, or 
proving that no such assignment exists. While 2-SAT can 
be solved in a time linear in the instance size T , K-SAT 
with i^T > 3 is NP-complete (2,. The worst-case running 
time may be quite different from the running time once 
averaged over some distribution of inputs. A simple and 
theoretically-motivated distribution consists in choosing, 
independently for each clause, a iiT-uplet of distinct vari- 
ables, and negate each of them with probability one half. 
The K-SAT problem, together with the flat distribution 
of instances is called random K-SAT. 

In the past twenty years, various randomized algo- 
rithms were designed to solve the random 3-SAT problem 
in linear time with a strictly positive probability Psuccess P] 
when the numbers M of clauses and N of variables tend 
to infinity at a fixed ratio ^ a = M/N . These algorithms 
are based on the recognition of the special role played 
by clauses involving a unique variable, called unit-clauses 
(unit-clauses are initially absent from the instance but 
may result from the operation of the algorithm) , and obey 
the following 



^ Other scalings for M, N were investigated too and appear 
to be easier to handle, see reference 



2 



C. Deroulers and R. Monasson: Criticality and Universality in the Unit-Propagation Search Rule. 



Unit-Propagation (UP) rule: When the instance 

INCLUDES (at least) A UNIT-CLAUSE, SATISFY THIS 
CLAUSE BY ASSIGNING ITS VARIABLE APPROPRIATELY, 
PRIOR TO ANY OTHER ACTION. 

UP merely asserts that drawing obvious logical conse- 
quences from the constraints is better done first. Its 
strength lies in its recursive character: assignment of a 
variable from a 1-clause may create further 1-clauses, and 
lead to further assignments. Hence, the propagation of log- 
ical constraints is very efficient to reduce the size of the 
instance to be treated. In the absence of unit-clause, some 
choice has to be made for the variable to assign and the 
truth value to give. This choice is usually made according 
to some heuristic, the simplest one being 

Random (R) heuristic: When the instance in- 
cludes NO unit-clause, pick any unset variable 

UNIFORMLY AT RANDOM, AND SET IT TO TRUE OR FALSE 
WITH PROBABILITIES ONE HALF. 

The specification of the heuristic e.g. R, together with UP, 
entirely defines the randomized search algorithm The 
output of the procedure is either 'Solution' if a solution 
is found, or 'Don't know' if a contradiction (two oppo- 
site unit-clauses) is encountered. The probability that the 
procedure finds a solution, averaged over the choices of in- 
stances with ratio a, was studied by Chao and Franco j7| 
Frieze and Suen with the result 



lim Psi 

V^oo 
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Fig. 1. Scaling of the probability of success of randomized 
search algorithm based on a heuristic H and the UP rule for 
the 2-|-p-SAT model with instances of size and having a 
clauses per variable. The dynamic (or kinetic) critical line is 
represented for the R heuristic, q:r(p) — 1/(1 — p) for p < |, 
24p/(2-|-p)^ for p > 2 separates the failure phase (upper region) 
from the success phase (lower region). Along the critical line, 
^success decays as a power law for p < § with a p-dependent 
exponent y{p) = (1 — p)/(2 — 5p)/6, and as a stretched expo- 
nential with exponent ^ when p > |. The static critical line 
(dashed curve) coincide with the dynamic one for p < § and 



(2) lies above for larger values of p, see Section f2. II 



for Random 3-SAT, R heuristic & UP. The above study 
was then extended to more sophisticated and powerful 
heuristic rules H, defined in Section It appears that 
identity (O holds for all the randomized algorithms based 
on UP and a heuristic rule H, with a critical ratio value 
an that depends on H. Cocco and Monasson then showed 
that, for ratios above aH, the probability of success is 
indeed exponentially small in N, in agreement with the 
generally expected behaviour 

The transition from success to failure in random 3-SAT 
was quantitatively studied in a recent letter [HI . We found 
that the width of the transition scales as N~3, and that 
the probability of success in the critical regime behaves 
as a stretched exponential of N with exponent i. More 
precisely, 



aH{l + eN-T>),N ^exp(~NB <P{e) [1 + o(l)] 



(3) 

for Random 3-SAT, and UP. The calculation of the scaling 
function <P relies on an accurate characterization of the 
critical distribution of the number of unit clauses, for an 
exact expression see eq. The important statement 

here is that the above expression holds independently of 
the heuristic H, provided the randomized procedure obeys 



^ This procedure is called the Unit-Clause algorithm in the 
computer science literature 1516! . 



UP This result allowed us to evoke the existence of a 
universality class related to UP. 

In the present paper, we provide all the calculations 
which led us to equation (PJ. We also calculate subdom- 

inant corrections to the Ne scaling of InPguccoss in Qi 
which allow us to account for numerical experiments in a 
more accurate manner than in reference (see SectionlHI) . 
In addition, we study the robustness of the stretched expo- 
nential behaviour with respect to variations in the prob- 
lem to be solved. We argue that the class of problems 
to be considered for this purpose is random 2-|-p-SAT, 
where instances are mixed 2- and 3-SAT instances with 
relative fractions p and I — p (for fixed < p < 1) |10| . 
Our results are sketched on Figure ^ It is found that the 
stretched exponential behaviour holds for a whole set of 
critical random 2-fp-SAT problems, but not all of them. 
More precisely, equation Q remains true for p > 2/5. For 
p < 2/5, the probabihty of success at criticality decreases 
as a power law only. 



aR{p){l + eN-^),N 



[1 + 0(1)] (4) 



* Strictly speaking, <P depends on H through two global mag- 
nification ratios along the X and Y axis: ^//(e) = yn^ixn 
where xhjVh are heuristic-dependent real numbers and $ is 
universal. 
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for Random 2+p-SAT, _p < 2/5, and UP. The value of the 
universal decay exponent is j{p) — (1 — p)/{2 — 5p)/6. 
The calculation of the prefactor Ah{p,£) shows similari- 
ties with the one of «?(e) above, but is more difficult and 
shown in Section |31 



2 Definitions and brief overview of known 
results 

2.1 Random X-SAT and 2+p-SAT problems 

In the random if- SAT problem [TTHEl, where K IS an in- 
teger no less than 2, one wants to find a solution to a set 
of M = aN randomly drawn constraints (called clauses) 
over a set of N Boolean variables Xi {i — 1 . . . N): Xi 
can take only two values, namely True [T] and False (F). 
Each constraint reads V V . . . V z;^, where V de- 
notes the logical OR; is called a literal: it is either a 
variable or its negation Xi^, with equal probabilities 
(= and (ii, i2, • • ■ , iK) is a ii'-uplet of distinct integers 
unbiasedly drawn from the set of the (^) if-uplets. Such 
a clause with K literals is called a iiT-clause, or clause of 
length K. Such a set of M clauses involving N variables is 
named an instance or formula of the iiT-SAT problem. An 
instance is either satisfiable (there exists at least one satis- 
fying assignment) or unsatisfiable (contrary case). We will 
be mainly interested in the large N, large M limit with 
fixed a — M/N and K. Notice that the results presented 
in this paper are, or have been obtained for this 'flat' dis- 
tribution only, and do not hold for real-world, industrial 
instances of the SAT problem. 

A distribution of constraints will appear naturally in 
the course of our study: the random 2-|-p-S AT problem ^U] . 
For fixed < p < 1, each one of the M = aN clauses is of 
length either 2 or 3, with respective probabilities 1 —p and 
p. Parameter p allows one to interpolate between 2-SAT 
(p = 0) and 3-SAT (p= 1). 

Experiments and theory show that the probability Pgat 
that a randomly drawn instance with parameters p, a be 
satisfiable is, in the large N limit, equal to (respectively, 
1) if the ratio a is smaller (resp., larger) than some critical 
value ac{p)- Nowadays, the value of ac{p) is rigorously 
known for p < | only, with the result ac{p) = 1/(1 — 
p) Pni, see Figure n For 3-SAT the best current upper 
and lower bounds to the threshold are 4.506 TT and 3.52 
[T5j respectively. For finite but large N, the steep decrease 
of Psat with a (at fixed p) takes place over a small change 
Sa = 0{N~i^) in the ratio of variables per clause, called 
width of the transition. Wilson has shown that the width 
exponent v is bounded from below by 2 (for all values 
of p) TB". For 2-SAT, a detailed study by BoUobas et al. 
establishes that ly ^ 3 ^7], and that Pjat is finite at the 
threshold a = 1. A numerical estimate of this critical Pgat 
may be found in reference jlS] and we provide a more 
precise one m AppendixEl Psat(p = 0,a = 1) = 0.907 ± 
10-3. 



2.2 Greedy randomized search algorithms 

In this paper, we are not interested in the probability of 
satisfaction Pgat (which is a property characteristic of the 
random SAT problem only) but in the probability Psuccess 
(< Psat) that certain algorithms are capable of finding a 
solution to randomly drawn instances. These algorithms 
are defined as follows. 

Initially |19lllllll3] , all variables are unset and all clau- 
ses have their initial length {K in the K-SAT case, 2 or 3 
in the 2-|-p-SAT case). Then the algorithms iteratively set 
variables to the value T (true) or F (false) according to 
two well-defined rules (mentioned in the introduction and 
detailed below), and update (reduce) the constraints ac- 
cordingly. For instance, the 3-clause (xi Va;2 Va;3) is turned 
into the 2-clause (xi Vxs) if X2 is set to T, and is satisfied, 
hence removed from the list of clauses, if X2 is set to P. A 
1-clause (or unit-clause) like xi may become a 0-clause if 
its variable happens to be inappropriately assigned (here, 
to T); this is called a contradiction. In this case, the algo- 
rithms halt and output 'don't know', since it can not be 
decided whether the contradiction results from a inade- 
quate assignment of variables (while the original instance 
is satisfiable) or from the fact that the instance is not sat- 
isfiable. If no contradiction is ever produced, the process 
ends up when all clauses have been satisfied and removed, 
and a solution (or a set of solutions if some variable are 
not assigned yet) is found. The output of the algorithms 
is then 'Satisfiable'. 

We now make explicit the aforementioned rules for 
variable assignment. The first rule, UP (for Unit- 
Propagation) 0, is common to all algorithms: if a clause 
with a unique variable (a 1-clause), e.g. xi, is produced at 
some stage of the procedure, then this variable is assigned 
to satisfy the clause, e.g. xi := F. UP is a corner stone 
of practical SAT solvers. Ignoring a 1-clause means taking 
the risk that it becomes a 0-clause later on (and makes 
the whole search process fail), while making the search 
uselessly longer in the case of an unsatisfiable instance ^. 

Therefore, as long as unit-clauses are present, the algo- 
rithms try to satisfy them by proper variable assignment. 
New 1-clauses may be in turn produced by simplification 
of 2-clauses, and 0-clauses (contradictions) when several 
1-clauses require the same variable to take opposite logical 
values. 

The second rule is a specific and arbitrary prescription 
for variable assignment taking over UP when it cannot be 
used i.e. in the absence of 1-clause. It is termed heuristic 
rule because it does not rely on a logical principle as the 
UP rule. In the simplest heuristic, referred to as random 
(R) here, the prescription is to set any unassigned vari- 

^ Another fundamental rule in SAT solvers that we do not 
consider explicitly here, although it is contained in the CL 
heuristic |15) to which our results apply, is the Pure Literal 
rule 121, where one assigns only variables (called pure literals) 
that appear always the same way in clauses, i.e. always negated 
or always not negated). Removal of pure literals and of their 
attached clauses make the instance shorter without affecting 
its logical status (satisfiable or not). 
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able to r or with probability ^ independently of the 
remaining clauses [SlIHl- More sophisticated heuristics are 
able to choose a variable that will satisfy (thus eliminate) 
the largest number of clauses while minimizing the odds 
that a contradiction is produced later. Some examples are: 

1. GUC (5j (for Generalized Unit Clause) prescribes to 
take a variable in the shortest clause available, and 
to assign this variable so as to satisfy this clause. In 
particular, when there are no 1-clauses, 2-clauses are 
removed first, which decreases the future production 
of 1-clauses and thus of contradictions. 

2. HL (for Heaviest Literal) prescribes to take (in ab- 
sence of 1-clauses, as always) the literal that appears 
most in the reduced instance (at the time of choice), 
and to set it to T (by assigning accordingly its vari- 
able), disregarding the number of occurrences of its 
negation or the length of the clauses it appears in. 

3. CL (for Complementary Literals) prescribes to 
take a variable according to the number of occurrences 
of it and of its negation in a rather complex way, 
such that the number of 2-clauses decreases maximally 
without making the number of 3-clauses too much de- 
crease. 

4. KCNFS |2Qj is an even more complex heuristic, spe- 
cially designed to reduce the number of backtrackings 
needed to prove that a given instance is unsatisfiable, 
on top of standard tricks to improve the search. 



2.3 The success-to-failure transition 



success 



Chao and Franco have studied the probability Pgi 
that the randomized search process based on UP and the 
R heuristic, called UC (for unit-clause) algorithm, suc- 
cessfully finds a solution to instances of random 3-SAT 
with characteristic parameters a,N [H], with the result 
((21). This study was extended by Achlioptas et al. to the 
case of random 2-|-p-SAT 13^, with the following outcome: 



lim R 



Af— >oo 



success 



> if a < OiB^ip) 
= if a > q;r(p) 



(5) 



for the R heuristic and UP, where 



1 2 

aR(p) = if P<T 

I — p 5 



Hence, as simple as is UC, this procedure is capable to 
reach the critical threshold ac'{p) separating satisfiable 
from unsatisfiable instances when p < |. For p > |, a 
finite gap separates aB.{p) from acip) in which instances 
are (in the large N limit) almost surely satisfiable but UC 
has a vanishingly small probability of success. 

Similar results were obtained for the heuristics H listed 
above, especially for p = 1 (3-SAT). The smallest ratios, 
called thresholds and denoted by an, at which Psucccss 
vanishes in the infinite N limit are: ccguc — 3.003 0, 
auL ^ 3.42 U^, acL ^ 3.52 [Tg. The 3-SAT threshold 
for the KCNFS heuristic is not known. 

In this paper, we are interested in the critical scaling 



or very close to its critical and heuristic-dependent value 
an (at fixed p) . More precisely, we show that Psucccss may 
vanish either as a stretched exponential JSjl or as an inverse 
power law |@J. Strikingly, although the randomized search 
algorithms based on different heuristics exhibit quite dif- 
ferent performances e.g. values of an, we claim that the 
scaling of -Psucccss at criticality is essentially unique. The 
mechanism that monitors the transition from success to 
failure at an of the corresponding algorithms is indeed 
UP. For instance, for KCNFS, a numerical study shows 
that the special, complex heuristic is never used when a 
is close to the aKCNFS threshold of this algorithm. 

Hereafter, we show that universality holds for random 
K-SAT with > 3 and for 2-hp-SAT with p > | one the 
one hand, and p < | on the other hand. In the p < | 
case, for which an and ac coincide (Sect. Isj, there is 
strictly speaking an infinite family of universality classes, 
depending on the parameter p (in particular, the critical 
exponent Jh{p) — see Q — varies continuously with p). 
These analytical predictions are confirmed by numerical 
investigations. 



3 Generating function framework for the 
kinetics of search 

This section is devoted to the analysis of the greedy UC= 
UP-I-R algorithm, defined in the previous section, on in- 
stances of the random 2 -|-p-SAT or K-SAT problems. We 
introduce a generating function formalism to take into ac- 
count the variety of instances which can be produced in 
the course of the search process. We shall use b{n; m, q) = 
(™) 9™ (1 ~ '?)'""" to denote the Binomial distribution, and 
^n.rn to represent the Kronecker function over integers n: 



of P 



success 



with N, that is, when a is chosen to be equal, 



{1 if n = TO, otherwise}. 



3.1 The evolution of the search process 

For the random 2 -I- p-SAT and K-SAT distributions of 
boolean formulas (instances), it was shown by Chao and 
Franco |7||S] that, during the first descent in the search tree 
i.e. prior to any backtracking, the distribution of resid- 
ual formulas (instances reduced because of the partial as- 
signment of variables) keeps its uniformity conditioned to 
the numbers of ^-clauses (0 < ^ < K). This statement 
remains correct for heuristics slightly more sophisticated 
than R e.g. GUC, SCi 5„6i7,, and was recently extended 
to splitting heuristics based on literal occurrences such as 
HL and CL Therefore, we have to keep track only 
of these numbers of -^-clauses instead of the full detailed 
residual formulas: our phase space has dimension K -\- 1 
in the case of K-SAT (4 for 2 + p-SAT). Moreover, this 
makes random 2-|-p-SAT a natural problem. After partial 
reduction by the algorithm, a 3-SAT formula is turned 
into a 2 -|- p-SAT formula, where p depends on the steps 
the algorithm has already performed. 

Call P(C;T) the probability that the search process 
leads, after T assignments, to a clause vector C — (Cq, 
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Ci, C2, . . . , Ck)- Then, we have 

P{C';T+1) = ^M[C' ^C;T] P{C;T) (7) 



c 



where the transition matrix M is 



M[C' ^ C; T] = (1 - Scfi) Mup[C" ^ C; T] + 

5ci,oA/r[C'^C;T] . (8) 

The transitions matrices corresponding to unit-propaga- 
tion (UP) and the random heuristic (R) are 



Mx[C' ^ C;T] 



Ck 

E 



b[zK;CK,K^i] 



b[rK-i;zK, -] Sc'^^Ck-^k 



X E 

Ck-1 

X ^ b[zK-i;CK^i,iK - 

Zk-1 



X 



J2 b[rK-2;ZK-l,h Sc'^_^. 



Ck-1— zk -1+1" K-i 



rK-2=0 

C2 

X ... X ^ b[z2;C2,2^] 

Z2=a 



(9) 



Z2 ^ 



>C'C2-Z2+r2 



ri=0 

where ^ := 



N-T 



and, for X=UP and R, 



Ci-l 



zi=0 

xE^ 

ro=0 



(10) 



The above expressions for the transition matrices can 
be understood as follows. Let A be the variable assignment 
after T assignments, and the residual formula. Call C 
the clause vector of Assume first that Ci > 1. Pick 
up one 1-clause, say, £. Call Zj the number of j-clauses 
that contain £ or £ (for j = 1,2,3). Due to uniformity, 
the Zj's are binomial variables with parameter j /{N — T) 
among Cj — 5j^i (the 1-clause that is satisfied through 
unit-propagation is removed). Among the Zj clauses, 
contained I and are reduced to (j — l)-clauses, while the 
remaining Zj — rj_i contained £ and are satisfied and re- 
moved. Tj-i is a binomial variable with parameter 1/2 
among Zj. 0-clauses are never destroyed and accumulate. 
The new clause vector C is expressed from C and the 
Zj+iS, rj^s using Kronecker functions; thus, AIp[C' , C; T] 
expresses the probability that a residual formula at step T 
with clause vector C gives rise to a (non violated) residual 



instance at step T + 1 through unit-propagation. Assume 
now C'l = 0. Then, a yet unset variable is chosen and set 
to T or F uniformly at random. The calculation of the new 
vector C is identical to the unit-propagation case above, 
except that zi = rg = (absence of 1-clause). Hence, 
putting both Ci > 1 and Ci = contributions together, 
Af[C ^ C;T] expresses the probability to have an in- 
stance after T + 1 assignments and with clause vector C" 
produced from an instance with T assigned variables and 
clause vector C. 



3.2 Generating functions for the numbers of clauses 

It is convenient to introduce the generating function 
G( X ; T ) of the probability P{C;T) where 

X :— {Xq, Xi,X2, . . . , Xk), 

G{X -T) ■.= Y^ X^" X^\..X^'' P{C ,T). 
c 

Evolution equation |2J for the P's can be rewritten in 
terms of a recurrence relation for the generating func- 
tion G, 

G{X-T+l) = ^G{XoJi,f2,...jK\T)+ (11) 
h 

1 



1-- ) G(Xo,0,/2,/3,...,/k;T^ 



where /i , . . . , fa stand for the functions 



f,{X-T) ■.= X, 



N-T 



-X, 



(12) 



(j — l,...,if). Notice that probability conservation is 
achieved in this equation: G(l, 1, . . . , 1; T) = 1 for all T. 

Variants of the R heuristic will translate into addi- 
tional contributions to the recurrence relation l|ll|) . For 
instance, if the algorithm stops as soon as there are no 
reducible clauses left (Gi = C2 = ■ ■ ■ — Ck = 0) in- 
stead of assigning all remaining variables at random (such 
a variation is closer to what is used in a practical search 
algorithm), the transition matrix is modified into 

M\C' ^ C; T] = (1 - <5ci.o) Mup[C' ^ C; T] + 
<5ci,o(l - <5c2,o'5c3,o ■ ■■^Ck.'a) A/r[C' ^ C;T] (13) 
and equation (|ll|l becomes 

G(X;T+1) = lG(Xo,/l,/2,...,//^;T) + 
/l 



1-- ) G(Xo,0,/2,/3,...,/k;T] 



-G(Xo,0,0,...,0;r) 



(14) 



In this case, G is not normalized any longer; 
G(1, 1, . . . , 1; T) is now the probability that search has 
not stopped after assignment of T variables. One could 
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also impose that the algorithm comes to a halt as soon as 
a contradiction is detected i.e. when Cq gets larger than 
or equal to unity. This requirement is dealt with by set- 
ting Xq to in the evolution equation for G (fTT|l , or ^^T^ . 
All probabilities are now conditioned to the absence of 
0-clauses, and, again, G is not normalized. 

For the more complicated heuristic GUC (without stop- 
ping condition), the recurrence relation reads 

G(X;T + 1) = 1g(Xo,/i,/2,...,/k;T) (15) 
Ji 



G(Xo,0,0,h,---jK;T^ 





1 






1 




( 1 



1 



Jk /a'-i 



G Xo,0,0,...,0,/i^;T 



The above recurrence relations ((TT)) . ((T3|l ... will 

be useful in subsection 3.4 to derive the distribution of 
unit-clauses. As far as j-clauses are concerned with j > 2, 
we shall see in subsection 13.31 that, thanks to self-avera- 
geness in the large N limit, it is sufficient to know their 
expectation values, {Gj){T). The average number of j- 
clauses is the derivative, evaluated at the point Xq = Xi = 
. . . = Xk = 1, of the generating function G with respect 
to Xj: (Cj){T) = dx, lnG(l, 1, . . . ,1;T) ^ . Evaluating the 
derivative at another point is used to take conditional av- 
erage: for instance, the average of Gi conditioned to the 
absence of 0-clauses is (Gi)(r) = dx, lnG(0, 1, . . . , 1; T) 
(here, G(0, 1, 1, . . . , 1; T) may be less than 1 as we have 
seen). Taking derivatives with respect to more than one Xj 
would give information about correlation functions and/or 
higher order moments of the Gj 's. 

For evolution equation the system of evolution 

equations for the {Cj){T)'s is triangular: 



(G,)(T+1)-(G,)(T) 



] 



N-T 



1 J + 1 

2N -T 



(16) 

(G,+i)(r) if2<j<K 



(Gi)(r + i)-(Gi)(r) = - 



1 



N-T 



{Ci){T) 



(17) 



+ j^{C2)iT) + (1 - j^){^{Sc,,,)iT) 1) 
(Go)(r+l)-(Go)(r) = 

^ [{C,)iT)-l + (5c,,o){T)) = 



(18) 



2{N -T) 
1 

2{N -T) 



(max(Gi-l,0))(r) 



(with Ck+1 ■= 0) and it can be solved analytically, start- 
ing from {Ck){T) down to {G2){T), with the initial condi- 
tion Cj (0) = aNSj^K (a is the initial clauses-per- variables 
ratio). However, the equations for (Gi)(T) and (Go)(T) 
involve more information than the averages of the G^-'s, 

^ The logarithm plays no role when G is normalized, i.e. 
G(l,l,...,l;r) = l. 



namely the probability that there is no 1-clauses, and they 
can't be solved directly: we shall study the full probability 
distribution of Gi in the sequel, in order to extract the fi- 
nally useful information, that is the probability {Sco,o){T) 
that the search process doesn't fail (doesn't detect any 
0-clause or contradiction) up to step T of the algorithm. 

Before going on, let us point out that at least two 
strategies are at hand to compute this finally useful quan- 
tity. We just explained one: we set Xo ~ Xi = 1 in G, 
compute the averages of the Gj 's {j > 2) (these stochastic 
variables turn out to self-average in this case where Gq and 
Gi are free — see below) , compute the full distribution of 
Go and Gi conditioned to the averages of the Gj 's (j > 2), 
and finally extract the probability that Go vanishes up to 
time T. The other one starts with noticing that this last 
probability is nothing other than G(0, 1, 1, . . . , 1; T): thus, 
it seems more natural to compute it through studying the 
generating function G(0, Xi, X2, . . . , Xk] T) with Xo set 
to 0, i.e. to condition all probabilities and averages on 
the success of the search, or equivalently to require the 
process to stop as soon as a contradiction appears. But 
this would prevent us to solve the evolution equations for 
the (Gj)(T)'s and would finally lead to more complica- 
tion: indeed, in such a case, since G is not normalized, 
the quantity G(0, 1, 1, . . . , 1; T), that expresses the prob- 
ability that no contradiction is found, and that can't be 
expressed without information about Gi , appears in every 
equation — or, put in another way, there are correlations 
between all G^ 's and Gi. Therefore, we prefer to take the 
seemingly less direct first route and study from now on 
only the simplest kinetics (|11() . with UC heuristic and no 
stop condition. 

Another way of circumventing this problem could be 
to do a kind of coarse-graining by grouping steps of the 
algorithm where 1-clauses are present (and the Unit Prop- 
agation principle is used) into so-called rounds |2llll5) . and 
then do as if the rounds where the elementary steps: at the 
end of each step, Gi is always vanishing, so that one needs 
to keep track only of the Gj's, j > 2, in the coarse-grained 
process. 

3.3 On the self-averageness of clause numbers and 
resolution trajectories 

Is the knowledge of the sole averages (Gj)(T) enough, at 
least for 2 < j < if, to compute the success probability of 
the search process? The answer is yes, in a large range of 
situations, because the Gj's are self-averaging (for j > 2). 

It may be shown rigorously using Wormald's the- 
orem 122], that, with the kinetics defined above and no 
constraints on the Gj's (i.e. with all Xj set to 1), G2, G3, 

Gk are self-averaging in the T,N +00 with fixed 
t := T/N limit in such a way that we can approximate 
them by continuous functions of the reduced parameter 
i e [0,1]: 



Gj (T) = Ncj (t) + o{N - T), 2<j<K 



(19) 



where o{N — T) is actually an asymptotically Gaussian 
fluctuation term, i.e. a stochastic variable with average 
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0(1) and standard deviation 0{VN - T) (TV - T is the 
number of not-yet-assigned variables). The self-averageness 
of the Cj 's is a consequence of the concentration of their 
variations [H]: given C{T), the variation terms ACj := 
Cj{T +1) - Cj{T) for j > 2, eq. (113 are concentrated 
around constant averages, and these averages {ACj) may 
be approximated by continuous functions gj(C2/{N — T), 
C3/{N-T), . . . , CK/{N-T),t) with errors o(l). However, 
the S term in eq. H17|l and the max term in eq. H18() are not 
smooth and prevent the existence of continuous functions 
gi and go. This has deep consequences, since the distri- 
bution of Ci is found to be broad (in the large TV limit, 
the standard deviation is not negligible w.r.t. the average, 
but of the same order of magnitude). We conclude as in 
previous subsection that we shall be obliged to study the 
full distribution of Ci and Cq. 

Eq. H19() ensures that we can safely replace the val- 
ues of the Cj's, j > 2, with their averages in the large 
N,T limits. Let £{T) be a probabilistic event at step 
T, such as: 'the search detects no contradiction up to 
step T'. We divide the space C2(T'), C3(r'), . . . , Ck{T'), 
< T' < T into a tube C centered on the average trajec- 
tory (C2)(T'), . . . , {Ck){T') and into its exterior C. The 
probability of £{T) then reads: 



L[£{T)] = E[£{T) nC]+ E[£{T) n C] 



(20) 



and we shall choose the size of C so that the second term 
is negligible with respect to the first one, i.e. to the prob- 
ability of £{T) conditioned to the (Tj 's lying close to their 
averages at all times < T' < T. 

Fix 2 < j < X and 1/2 < S < 1. At time < 
T' < T, if the asymptotic standard deviation of Cj{T') is 
aj{T')\/N — T, the probability that the discrete stochas- 
tic variable Cj (T') lies away from its average by more than 
aj{T'){N ~ TY , or equivalently that the (asymptotically) 
continuous stochastic variable Cj{T')/[aj{T')y/ N — T] lies 
away from its average by more than (iV — T)''^^/^, is 



A{T') := 2 



da; 



+0 



+00 

(Af-T)'S-i/2 \/2tt 

2 

2^(7V„T)*-i/2^ 



{N - r)3'5-3/2 



(21) 



Although the value of (Jj{T') depends on T', it varies only 
smoothly with the reduced parameter t — T/N and it 
makes sense to use a single exponent d to define the region 
C. The probability that Cj{T') stays close to the average 
trajectory up to aj{T'){N - Tf from T' = to T is then 



A :^\{ (l-Z\(T')) = l-0(iV3/'-V^""/') . (22) 

T'=0 

Generalizing this to the parallelepipedic region C with 
boundaries such that each Cj{T'), 2 < j < K, is always 



at the distance at most aj{T'){N — T)* from its average, 
we find that the measure of C is 

E[C] = (1 - Z\)^-i = 1 - o(iV3/2-5e-JV^^-V2) (23) 
and the complementary measure is E[C] = 1 — E[C] so that 



E[£{T)] = E[£{T) n C] +o(n^^^-^ 



-V2 



(24) 



where the second term vanishes as N gets large, as we 
wished. 

Finally, let us draw the scheme of the computations to 
follow: any trajectory of C2(T), . . . , Ck{T) inside C brings 
a contribution to E[£(r) n C] that lies close to the con- 
ditional average E[£{T)\C2{T) = Nc2{t), . . . ,Ck{T) = 
NcK{t)] by an relative error at most 0{N'^) in any di- 
rection {+C2, —C2, ■■■),( being computed later, together 
with the conditional average (it depends presumably on S). 
Thus, we can approximate the total contribution E[£(r)n 
C] with 



1 - O (a^3/2^ 



E[£{T)\C2{T) = Nc2{t),...,CK{T)^NcK{t)] x 
[l + 0(iV?)] (25) 

to get 

E[£{T)] = 

E[£{T)\C2{T)^Nc2{t),...,CK{T)=NcK{t)\ 

X [l + C'(iV^)] 
+0 (7V3/2-5e-^"-V2) (26) 

where we shall have to ensure (by a proper choice of 5 
if possible) that the neglected terms are indeed negligi- 
ble with respect to the computed conditional expectation 
value: if 5 is too large, the weight of the region C is very 
small, but we allow deviations from the average and typ- 
ical trajectory inside the (too loose) region C that may 
bring contributions substantially different from the typi- 
cal one. Conversely, if we group into C near the typical 
trajectory only the most faithful trajectories, we have a 
good control over the main contribution, but the weight 
of the 'treaters' in C may not be negligible any more. 

The self-averaging of Ci and Cq (or its lack) has con- 
sequences that may be observed numerically. Let us study 
the distribution over instances of the probability P that 
the UC=UP-|-R greedy, randomized algorithm detects no 
contradiction during its run. That is, for each of the « 
4000 instances that we draw at random, we do 10** to 10^ 
runs of the algorithm (with different random choices of 
the algorithm) and we estimate the probability of success 
of the algorithm on this instance ^. 



^ Alternatively, we could get the same result by doing one 
run of the algorithm on each instance (and averaging over many 
more instances) since the sequence 6 of choices of the algorithm 
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The cumulative distribution function of P is plotted 
in figure 121 for instances of 3-SAT with initial clauses-per- 
variable ratio a = 2 (left curves) and 8/3 (right curves), 
for sizes of problems N = 1000 and 10 000. For each size 
N, P is rescaled to fix the average to and the standard 
deviation to l/\/2. For a < 8/3, Ci has finite average 
and standard deviation when N — > +oo, and ACi may be 
approximated by a continuous function gi like the Z\Cj's 
for j > 2 (see Sect. 14. 1|) . The numerical distributions of 
P are successfully compared to a Gaussian distribution 
(the average of P for a < 8/3 is computed in Sect. 14.11 
see Eq. (0Hll)- For a — 8/3, things are different. Ci has 
average and standard deviation of the order of N^^^ (see 
Sect. 14.21 and following). As for a = 2, the width of the 
finite-size distributions of P vanishes with N — they con- 
centrate about their average (computed in Section (BJ see 
Eq. H123(l ^. and the rescaled finite-size distributions of P 
are numerically seen to converge to a well-defined distri- 
bution. However, this distribution is not Gaussian — this 
effect seems rather small, but significant. 
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Fig. 2. Numerical estimates for the cumulative probability 
distributions of the probability P of success of the UC=R-|-UP 
algorithm on random 3-SAT instances with sizes N = 10^ 
(dots) and 10'' (dashes) and initial clauses-per-variable ratios 
a = 2 (left) and 8/3 (right). The P-axis for each distribution 
(each value of a and A^) is chosen so that the rescaled distribu- 
tions have average and standard deviation l/\/2. The solid 
lines show the Gaussian cumulative distribution [1 -|- erf (a;)]/2. 
For a — 2, numerical distribution are in good agreement with 
a Gaussian shape, but not for the critical ratio a = 8/3. 



on the instance A is the same as the sequence a of choices 
of the algorithm on an instance B obtained by relabeling the 
variables and the clauses of instance A. However, this technique 
was slower in practice because much time is spent building new 
instances. 



If we now plug the self-averaged form 1)19(1 of the Cj's, 
J > 2, in their evolution equations H16|l . we get, using the 
reduced parameter t = T/N, 



dcj_ 
dt 



J 



l-t 



c,(t) + 



J + 1 

2(1 -t) 



=j+i(i) 2<j<K 
(27) 

with ck4^i := 0. This triangular system of equations, with 



the initial conditions Cj(0) 
given K. For instance, for 2 
the solution reads 



: a5j_K: is easily solved for 
-p-SAT and the R heuristic. 



c^{t) = ap{l - tf 

C2{t) ^a{?,pt/2 + l-p){l-tf 



(28) 
(29) 



whereas, for 2 -t- p-SAT and the GUC heuristic, 

c:i{t) = ap{l - tf (30) 
C2{t) = {l-t) {ln(l -t)+ a[3pti2 ~ t)/A + (1 - p)]} . 

(31) 

For K-SAT with the R heuristics with initial ratios aj = 
C,/N, 

C2(0 = (l-0'E«.j(j-l)2^-^i^-' • (32) 

The parametric curve [c2{t), C3(i), . . . , c_R-(t)], < t < 
1, will be called resolution trajectory. It describes, forget- 
ting about 0- and 1-clauses, the typical evolution of the 
reduced formula under the operation of the algorithm. In 
the case of 3- or 2 -I- p-SAT, a useful alternative phase 
space is the px a plane, where a = (c2+ C3)/(l — t) is the 
(instantaneous) 2-or-3-clauses-per-variable ratio and p is, 
as usual, the proportion of 3-clauses amongst the 2- and 3- 
clauses. Some resolution trajectories of the UC algorithm 
are shown on figureOl They all end at the ci = C2 = C3 = 
point with t — 1: almost all variables have to be assigned 
before no clauses are left (and a solution is found). More 
clever algorithms such as GUC are able to find solutions 
with a finite fraction of remaining unset variables, and 
give a family of solutions with a finite entropy at once. 



3.4 Reduced generating functions for 0- and 1-clause 
numbers 

From now on, we identify C2, C3, . . . , Ck with their asymp- 
totic averages NCj{t) as discussed above, and study the 
kinetics of Co and Ci as driven by C2, C3, . . . , Ck- Under 
these assumptions, it is easy to write the evolution equa- 
tion for Co and Ci that corresponds to I7J|: if 
P(Co, Ci; r|C2) is the probability that the search pro- 
cess leads, after T assignments and with imposed values 
of C2, C3, . . . Ck at all steps to iV, to the numbers Co, 
Ci of 0-, 1-clauses, 



P(C^,C;;r-M|C2) = 
EEAfi[C^,Cl 



(33) 

Co,Ci;T|C2] P(Co,Ci;T|C2) 



Co Ci 
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where the expression of Mi is deduced from that of M by 
canceling all what it written on the left of the sum over Z2 
in It is readily seen in that expression that, actually, 
the transition matrix Mi doesn't depend explicitly on C3, 
C4, Ck but only on C2; therefore, we dropped the 
unnecessary dependence in the above equation. Mi also 
depends explicitly on time through /i = 1/ (N — T) . This 
evolution equation translates into the following equation 
for the generating function 



M 



M 



Goi(Xo,Xi;T|C2) := ^ ^X^^^Xf ^P(C7o, Ci; TlCz) 



Co=0 Ci=0 



Goi{Xo,Xi;T+l\C2)= 1 



Xi-1 
N -T 



C2{a,T) 



X (34) 



■Goi(Xo, /i; TIC2) 



1 

Ti 



Goi(Xo,0;r|C2) 



where /i = ^+-^» 



2i^i^_T) I " ^1 (1 ^ ) ' OH • Since the Xq 
argument of Gqi is the same in all terms, we shall drop 
it when there is no ambiguity and use the lighter nota- 
tion Gi(Xi;r|G2) := Goi(Xo, Xi; TIG2). The key equa- 
tion above yields the main results of the next Sections: 
in particular, Goi(0, 1; TIG2) is the probability that the 
search process detected no contradictions (i.e. produced no 
0-clauses, even if there are already contradictory 1-clauses 
such as 'a' and 'a') while assigning the first T variables, 
and Goi(0, 1]T — 7V|G2) is the probability that a solution 
has been found by the search process, i.e. that all N vari- 
ables were assigned without production of a contradiction 
(if all variables were assigned, any produced contradiction 
was necessarily detected). 



4 The probability of success 

In this section, the generating function formalism is used 
to study the probabihty -Psuccoss(q:,p, N) that UC success- 
fully finds a solution to a random instance of the 2 -\- p- 
SAT problem with TV variables and aN clauses. We first 
consider the infinite size limit, denoted by Psucccss(q^,p)- 
As explained in Section 12.31 the probability of success 
vanishes everywhere but for ratios smaller than a criti- 
cal value a^{p). This threshold line is re-derived, with a 
special emphasis on the critical behaviour of Psucccss when 

We then turn to the critical behaviour at large but 
finite N, with the aim of making precise the scaling of 
-Psucccss(a,P, ^) with N. A detailed analysis of the be- 
haviour of the terms appearing in the evolution equation 
for the generating functions G is performed. We show 
that the resulting scalings are largely common to all algo- 
rithms that use the Unit Propagation rule. 



4.1 The infinite size limit, and the success regime 

For a sufficiently low initial clauses-per-variables ratio a, 
the algorithm finds a solution with positive probability 



when N ^ 00. It is natural to look for a solution of equa- 
tion (|34|) with the following scaling: 



Gi{Xi,T^tN\C2)=MXi,t)+o{l) 



(35) 



when T,N ^ +00, ttq being a smooth function of Xi 
and t {Xq is kept fixed to 0). 7ro(l,t) is therefore, in the 
N +00 limit, the probability that the search process 
detected no contradiction after a fraction t of the variable 
has been assigned. The probability of success we seek for 

is -Psucccss = 7I'o(l, !)■ 

We furthermore know that G2 , which drives the evolu- 
tion of Gi in (|34() . is concentrated around its average: we 
take 

G2(a, r = W) (A^ - T)d2{a, t) + O [{N - T)^] (36) 
with 



d2{t) ■.^C2{t)/{l-t) 



(37) 



and 1/2 < (5 < 1 will be chosen later. Inserting the above 
Ansatze into (|34f) yields 



x[ 



[7To{Xl,t) + {Xl-l)TTo{0,t)] 



+0 [(1 - Xi)(N - T^-^] +0[{N- T)-^] (38) 

hence, in the thermodynamic limit A^, T — > +00, an equa- 
tion for ttq, 

g(Xi-l)d2(t) 

^0(^1, t) = 77 [MXi,t) + (Ai - 1) 7ro(0,i)] • 

This equation does not suffice by itself to compute 7ro(l, i). 
Yet it yields two interesting results if we differentiate it 
w.r.t. Xi to the first and second orders in Ai = 1: 



7ro(0,t) 



{l-d2{t)) 7ro(l,t) 



dXi 



d2{t){2 - d2{t)) 
2{\-d2{t)) 



^0(1, i) 



(40) 
(41) 



Under assumption 7ro(0, t)/7ro(l, i) = 1 — d2[t) can 

be interpreted as the probability pi (t) that there is no 1- 
clause at time t conditioned to the survival of the search 
process. d2{t) is then the (conditional) probability that 
there is at least one 1-clause As pi{t) has to be posi- 
tive or null, d2{t) cannot be larger than 1. As long as this 
is ensured, pi{t) has a well-defined and positive limit in 
the N +00 limit (at fixed reduced time t). The condi- 
tional average of Gi, 9xi7ro(l, i)/7ro(l, t), can be expressed 
from H41|l and is of the order of one when A^ +00. The 
terms of the r.h.s. of 1)17(1 compensate each other: 1-clauses 
are produced from 2-clauses slower than they are elimi- 
nated, and do not accumulate. Conversely, in the failure 
regime (Sect. I^JJ, 1-clauses accumulate, and cause con- 
tradictions. 



* And the 5 term that appears in 11711 is actually a continuous 
function of C-i/iN - T) so that JTHJ holds also for j = 1. 
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To complete the computation of 7ro(l,i), we consider 
higher orders in the large N expansion of Gi. In gen- 
eral, this would involve the cumbersome fluctuation term 
O [{Xi - 1){N - r)*-i] , but, at Xi = 1, only the 'deter- 
ministic' O [{N — T)~-^] correction is left since C2 disap- 
pears from equation (|34|l . Thus we assume 

Gi(l,T = tN\C2) = 7ro(l,i) + ^^^ ^i(l,i)+o 



which yields, when inserted into (|34|l . 



(42) 



diTQ 



dXi 



(l,t) + 7ro(0,i)-^o(l,t) 



.(43) 

This equation 14311 can be turned into an ordinary differ- 
ential equation for 7ro(l,t) using (|4()|l and (|41|l : after inte- 
grating over with the initial condition 7r(l,i = 0) = 1 
i.e. no contradiction can arise prior to any variable setting, 
we find the central result of this section [^13| : 



7ro(l, t) = exp 



dr 



4(1 -r) l-d2(T) 



(44) 



which is finite if, and only if, d2{T) < 1 for all < t < 1 
(the apparent divergence at r = 1 is in practice compen- 
sated by the factors involving ^2). 

The above result can be used in 14()|l and l|39|l to com- 
pute 7ro(Xi, t), that is the generating function of the prob- 
ability P{Ci,t) that there are Ci 1-clauses and no contra- 
diction has occurred after assignment of a fraction t of the 
variables, 

MXi^t) := J2 P{Ci,t)X^' 

Ci>0 

As long as d2{t) < 1, the average number of 1-clauses 
{Ci){t) is finite as iV — -l-oo. This sheds hght on the finite- 
ness of PsuccGss. The probability of not detecting a contra- 
diction at the time step T T+1 is {l-^/2)"'''''^^^-^-°'^ ~ 
1 — max(Ci — l,0)/2/{N—T), and Psucccss is the product 
of 0{N) quantities of that order ^. 

The validity condition d2{t) — C2{t)/{1 ~ t) < 1 is 
fulfilled at all steps t if, and only if, the initial clauses- 
per- variable ratio a is smaller than a threshold, a^, as 
can be seen from the expression of C2{t) = {C2){T)/N 
that results from equation . Graphically, in the (p, a) 
plane, the resolution trajectory in Figure 01 stays below 
the line a(l — p) = 1 iff. a is small enough. Finding the 
threshold value for a and a given p is an easy ballistic 
problem: 

^ We can't go further and compute a function tti (Xi , t) cor- 
responding to the order in (I35II . since the 'Gaussian fluc- 
tuations' term O [(N - Tf] in JHSJl would dominate the 1/N 
introduced correction — only for Xi = 1 is this 1/A'^-term rel- 
evant, so that we could write down H43^ . It is also impossible 
to compute 7ri(l,t) alone. 



— If p < 2/5 ('2-SAT family'), whatever the initial clauses- 
per-variable a, the resolution trajectory (figure|3Jl will 
always either be entirely below the d2 — 1 line (suc- 
cess case, low a), or cut it (failure case, high a). The 
threshold value of a is reached when the resolution 
trajectory starts exactly on it (critical case), therefore 



a^{p) = \/{\-p) ifp<2/5. 



(46) 



If p > 2/5 ('3-SAT class'), the resolution trajectory 
for low a is also entirely below the o?2 = 1 fine (suc- 
cess case). This situation ends when the resolution 
trajectory gets tangent to the d2 = I line, whereas 
for p < 2/5 it was secant. All critical trajectories for 
p > 2/5 share the support of the critical trajectory 
for p = 1 (3-SAT) that starts at (p = 1,q; = 8/3), 
and all become tangent to the 1^2 = 1 fine at the 
{p = 2/5, a — 5/3) point (reached after a finite time), 
whereas for p < 2/5 there are several critical trajecto- 
ries. Here, 



an{p) = 24p/{2 + pf ffp>2/5. 



(47) 



The probability Psucccss = 7ro(l, 1) that the UC algo- 
rithm finds a solution is obtained from equation (|44|) with 
d2it) ^a{l~t){3/2pt+l-p), equation 



-In P., 



i{a,p) 



arctan- 



4^24p/(2 + p)Va- 1 
1 



(48) 



arctan 



V24p/(2+p)2/a-l 
5(p-2/5) 



(2-Hp)V24p/(2 + p)2/a-l 
1 \ 1 



1-p 



8 



ln(l -p) + 



a{p - 4) 
16 



Of particular interest is the singularity of Psuccoss slightly 
below the threshold ratio. At fixed p, as a increases, the 
first singularity is encountered when the resolution tra- 
jectory tangent to the ^2 = 1 hne is crossed i.e. for a = 
24p/(2-|-p)2 (figure|3| largest short-dashed line, and thick 
short-dashed line in the inset). If p > 2/5 (3-SAT class), 
the two arctan in equation H48(l tend to 7r/2 and Psucccss 
vanishes as (e < 0): 



-hiP, 



TT 1 

[(H-e)aR(p),p] = ^^3^ 



e{l). (49) 



For p — 2/5, one of the two arctan vanishes for all a, and 
the first In brings another singularity (e < 0): 



InP 



success 



8 



ln(- 



^0(1). 
(50) 

And for p < 2/5, the two arctan have opposite signs 
so that the first term of equation (|48|l has no singular- 
ity while crossing the 'limiting' resolution trajectory a = 
24p/ (2-|-p)2 (thin short-dashed hne of the inset of Fig.|3I). 



C. Deroulers and R. Monasson: Criticality and Universality in the Unit-Propagation Search Rule. 



11 




--8/3 



0.53 



0.4 0.6 
03/(03 + 03) 

Fig. 3. Some resolution trajectories of the UC=UP+R algo- 
rithm. If the trajectory of some algorithm stays strictly below 
the d2 = a{l—p) = 1 line (long-dashed line), 1-clauses don't ac- 
cumulate and the success probability Psucccss of the algorithm 
is finite as ^ -|-cx3 (solid trajectories). Critical behaviour 
is when the trajectory gets tangent to this line or starts on it 
(short-dashed trajectories). When the trajectory spends time 
in the region above this line (dotted trajectory), Psuccess gets 
exponentially small in TV. Inset: The singular behaviour of 
fsuccess around the (p = 2/5, a = 5/3) point is better un- 
derstood if one uses local coordinates {x,y). x is the distance 
along the common tangent to the 6,2 = 1 line and the critical 
trajectory, y is such that lines of constant y are, in the original 
coordinates, parabolas, tangent to the 6,2 = 1 line. The point 
(2/5, 5/3) is spread onto the waved line. Left and right limits 
of Psuccess for a; ^ on the waved line are well-defined but 
distinct. The A'^ +00 limit of — In Psuccess has the indicated 
singularities on the thick lines. 



A singularity is found when a reaches the 6,2 — ^ line, 
a = 1/(1— p) (thick long-dashed line of the inset of fig- 
ure 13), with the outcome (e < 0) 



InR 



success 



(1 + £)aR.(p),p] = - ^(^ _ In(-e) -1-0(1). 

(51) 

The difference of nature of the singularities between the 
2-SAT and 3-SAT families corresponds to different diver- 
gences of — In Psuccess with N , as will be computed in the 
next section. 

For completeness, let us check that the above calcu- 
lation is compatible with our approximation H26(l . The 
first term of the l.h.s. there is equal to Psuccess plus the 
1/{N — T) correction from equation H42() . The second term 
there, in 0{N''), corresponds here to the fluctuations of 
C2: 0{N^-^) in equation thus ( = (5 - 1. If we take 
for S any value on the allowed interval ]l/2, 1[, the two ap- 



proximation terms in 1)26(1 vanish as A'^ — > +00. Therefore, 
as long as Psuccess IS finite for large N, these approxima- 
tion terms are actually negligible. 



4.2 Large N scalings in the critical regime 

Our previous study of the success regime breaks down 
when c?2(i) reaches 1 during operation of the algorithm. 
Indeed, consider the infinite- A'^ generating function for Ci , 
Gi, given by eq. (|45() . As a function of Xi, Gi vanishes 
uniformly on any compact interval [0, 1 — 77], 77 > 0, as 
^2 — > 1- The Xi = 1 point is singular since normalization 
enforces 7ro(l,i) = 1 (assuming Xq = 1): all useful infor- 
mation is concentrated in a small region around Xi — 1. 
Expanding eq. (|45|l for Ai ^ 1 yields 



7ro(Ai,i) = <l + d2it) 



1-Ai Xidlit) 1-Ai 



l-d2{t) 



l-d2it) 



O 



l-d2{t) 



(52) 



Non-trivial results are obtained when 1 — Ai and 1 — d2{t) 
are of the same (vanishingly small) order of magnitude — 
let us call it A. We suspect that A is some negative power 
of N, to be determined below. Let us define 

Xi=:l-xiA, e2{t) := A-\l - d2{t)) (53) 

so that eq. H3t)|) now reads 

C2[a,T^tN] = [1 ~ e2ia,t)A]iN - T) + O [{N - tY] 

(54) 

where, as previously, the exponent 5 will be tuned later 
according to the framework eq. (|26|l . 

We assume that, in the thermodynamic limit A^ 
cx), Z\ — > but at fixed xi, 62 (i), and time t, the normalized 
generating function of C'l conditioned to the typical value 

0iC2{t), 



Gi{Xi = 1 - xiA,T = tN\C2) 
Gi{l.T^tN\C2) 



^(xi,t) (55) 



where the limit tt is a smooth function of xi and t (which 
also depends on Aq and eo). 'W{xi,t) is the generating 
function for the stochastic variable c := Ci/Z\, conditioned 
to the success of the algorithm. Eq. (|52|l gives information 
about the 62 —f +00 limit of If. Furthermore, eq. (|45|l 
shows that 



A-^Gi{{),T\C2)/Gi{\,T\G2) 



a{t) 



(56) 



a well defined limit of the order of unity for T = tN and 
C2 given by eq. H54() . We now plug the previous conven- 
tions and assumption into the evolution equation for the 
conditional, normalized generating function of Ci. This 
equation is formed by dividing eq. by Gi(l,r|C2), 
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which yields, in a formal way, 

LHS = RHSi + RHS2 

LHS :=Gi(Xi,T+l|C2)/Gi(l,T|C2) 



(57) 



RHSi 



1 



RHS2 := 1 



x.-i Y'^'^'^^ 1 Gi(/i;r|C2) 

N-TJ A Gi(l,r|C2) 

N-t) [ /JGi(1,T|C2) 



From this equation we get, in the following subsections, 
all results relevant to the critical behaviour of the success 
probability of the greedy algorithm. 



4.2.1 Analysis of the RHS terms 

The two contributions of the r.h.s. of the evolution equa- 
tion (|57|l have the detailed expression (for Xq — 0): 



RHSi = \ l + xie2{t)A'^ 



1 



2N{l-t) 2 



A 



O [xiA{N -Tf-^] 



7r(xi,t) + 



2N{l-t)A 
O [(iV-T)-M-2] I 



d^^'K{xi,t)+ 



(58) 



and 



1 



0{- 



xiA 



RHS2 = -cr(0^|a;iZ\-f , , . , 

^ ^ \ 2N{l-t) ^N{l-t)' 



OixfA"^) + 0[{N^ r)-2] + o {xjA^iN - T^-^] | 

where, as usual, T = tN. 

Apart from the dominant term Tf(xi,t) that cancels 
with the dominant term of the l.h.s., the first terms have 
order and 1 / (NA) (we here assume that i < 1 in the 
critical regime, which will be the case in all subsequent 
situations). Then, A^, l/N, A/N and so on are negligible 
for large N (because A vanishes, but slower than 1/{N — 
T) since the integer G2 may not vary by less than unity). 
So do the terms stemming from fluctuations of G2 around 
its typical value, A'^{N - Tf-^ and A{N - T^-^ if we 
choose S carefully (see below) . Choosing A such that A^ = 
1 / (NA) allows us to gather a maximal number of terms in 
the equation for tt. Other choices are possible but trivial, 
in that they would correspond to either the success or the 
failure regimes, but not to the critical case. From now on, 
A :=iV-i/3. 

Then, the fluctuations of G2 in the above expansion are 
of the order of A{N ~ T)^-^ = {N - T)*-4/3. Using the 
notations from eq. (p^ . C = (5-4/3. These fluctuations 
are negligible with respect to N"^/^ \i 5 < 2/3. Remember 



that the range of possible values for 5 was ]l/2, 1[; in the 
critical situation here, we may choose 5 e]l/2, 2/3[. It will 
be checked later that the third term of the l.h.s. of eq. (|26|l 
is also negligible w.r.t. the first one. Finally, 

RHSi + RHS2 = Tf{xi,t) + N-^l^ X 

1 x'^ 

^a:ri7f(xi,t) -I- {e2{t) XX + -A)w{xut) 



—(j{t)xi 



o 



{N - T)-\xi{N ~ T)*"4/3l ^59) 



4.2.2 Analysis of the LHS terms 

LHS in eq. (|57|l has to do with time evolution. The values 
of ^2 or 62 are given by the average value of G2 calculated 
in Section 13.31 62 is of the order of unity when N — > -l-oo 
if the resolution trajectory comes close to ^2 = 1 i-c. if the 
initial clauses-per-variable ratio a is close to its threshold 
value eqs. (|46I - I47|I . We zoom in around the time, say, t* 
where ^2 is closest to 1 (or equal to 1 if we are exactly on 
the borderline between the success and failure cases) and 
let 

T ^t* N + tQNA'' ^t* N + toN^^''/^. (60) 

a will be fixed later for each family of near-critical trajec- 
tories so that 1 — d2{t) is indeed of order Z\ on a finite 
interval of rescaled times to- We now assume that W and 
Gi(Xi — 1) have, when N 00 and time T is given by 
(I60|l . well defined limits, regular w.r.t. to The l.h.s. of 
eq. H57(l may be written for large N 

LHS = Gi(l,r+ l|G2)/Gi(l,T|G2) ^(xi,T-H 1) (61) 

where T + 1 ^ to + N-^A-"- = to + Af-i+'^/s. As time 
to goes on, the shape of the distribution of Gi/Z\ (en- 
coded into 7f) and the probability of success (given by 
Gi(l,r|G2)) both vary. Fix flrst xi at so that Xi = 1 
and 7f = 1, then eqs. (|61|l and (|59(l read 



RHSi + RHS2 = 1 + ^^^^_^^ iV-^/'0.,7f(l, t) + 



LHS = 1 + N-^+'^^^'dt, ln[Gi(l, to|G2)] (62) 

1 + — rN- 
2(1 -t) 

0[{N-T)-^]. (63) 

Comparing the two members, eq. (|62|l and eq. (|63|) . of 
eq. (|57jl shows that ln[Gi(l, to|G2)] is of the order of 

This assumption could presumably be demonstrated us- 
ing the same technique as for Wormald's theorem fl'I'i . If 
we introduce ex nihilo the functions tt and Gi that sat- 
isfy the equations 1)68(1 - 1169^ . we could show that the dif- 
ference between the sequences, for T from 1 to A'^, of 
the discrete quantities Gi(Xi,r|G2) and of the approxi- 
mate quantities tv[{1 - Xi)N^/^ , (T - Nt*)N-^+''^'^] Gi[l, 
to = {T - Nt*)N-^+''/'^], vanishes when +00. The rea- 

son is that the difference between two consecutive terms of 
each of the two sequences is the same up to a small quantity 
that yields a negligible difference at the final date T — N, and 
the initial conditions for both sequences are equal. 
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jSfii-a-)/^ ^ with subleading terms of the order of N^°-/'^ 
at most. Defining A:=(l — a)/3, we have in the large N 
limit 

-7V-MnGi(l,T|C2)^/i(to), (64) 

a regular function of t^. Eqs. (|62|l and l|59(l with xi = 
yield 



1 



— aj;i7r(0,io)- 



(65) 



2(1 -t*) 

As tF is the generating function of c ~ Ci / A (conditioned 
to success of the algorithm) , 

cito) := -d,MO,to) (66) 

is the conditional average of the rescaled number c of unit- 
clauses. 

For general xi now, the previous assumptions lead 
from the expression (|61(l of the l.h.s. of eq. (|57|l to 

LHS - 7f{xi,to) + N-^+'^'^dtM^iM 

-N-''/'w{xi,to)dt,fi(to) + 0{N-'+'^^'). (67) 

4.3 Critical evolution equations 

Comparing the two sides of the evolution equation, eq. 1)591) 
and eq. (|67|l . we are left with two situations. 

— If a — 1: A = and it is convenient to use the non- 
normalized (non-conditional) generating function 

7r(xi,io) := exp[-^(to)]7f(xi,to). 

The total probability here, 7r(0,to)7 is not 1 as for tt 
but the success probability of the greedy algorithm, tt 
satisfies the following PDE: 



dtoTr{xi,to) = 



1 



2(1 - t*) 
e2{to)xi 



dx^Tr{xi,to) + 



(68) 



7r(a:i, io) - cr(to)xi 



If a < 1: the probability of success has the scaling 
relationship Pgucccss oc exp[— ©(A^^^"")/^)]. The time- 
derivative term 9to7f(xi,io) is negligible w.r.t. other 
terms, and tt satisfies the ODE: 



[dxj^Tr{xi,to) + c(to)'n-{xi,tQ)] + 

Tr{xi,to) - a{to)xi . (69) 



2(1 -t*) 



xl 



The third possibility, a > 1, leads to inconsistencies and 
has to be rejected 

We would have either subdominant terms of the order of 
j^-i+a/3 ^ larger than the dominant term (of the order of 1) 
'n{x\,to) if a > 3, or the two equations 9t(,7f(a;i, to) = and an 
ODE for 7f at fixed to but with coefficients e2(to) and o"(io). 
In the latter case, since tt would be constant with time, 62 
and thus 6.2 should also be constant, which is impossible in the 
context of our algorithm (see Eq. 



In the previous section, we classified the critical res- 
olution trajectories into two families: those of the 2-SAT 
family {p < 2/5) start from the d2 — I fine but are secant 
to it, and those of the 3-SAT family {p > 2/5) do not start 
on this line but get tangent to it ('parabola situation'). As 
we will see in the next sections, these two families corre- 
spond, respectively, to values of the exponent a equal to 
1 and 1/2, making successively eq. and (|^ relevant. 



5 The 2-SAT class (power law class) 

5.1 Equations for 2-SAT and its family 

When p < 2/5, the critical resolution trajectory starts on 
the d2 = 1 fine and is secant to it (at time t* = 0) with 
slope 

The threshold value of a is aR(p) = 1/(1 —p) (eq.^BJ. On 
this resolution trajectory, l — d2{t — t*) cx t — t*. Therefore, 
this resolution trajectory is at distance A — A"^/'^ of the 
d2 = 1 line as long as t — i* is of order A^: the exponent 
a equals 1 here and the relevant equation is eq. (|58|) . 

The critical regime is realized when a is close to the 
value an (p) = — p). The relevant scaling is 



aR(p)(l + e„A 



-l/3\ 



(71) 



with finite since, if a is less than aji{p) by more than 
A = A^^/"^, at the initial date t = 0, d2{0) is already out 
of the critical region 1 — A (remember that d2 (t) decreases 
with time if p < 2/5 as can be seen in Fig.O. Conversely, 
if a is above aB,{p) by a distance much greater than A 
at time t = 0, 1 — ^2(0) is an order of magnitude higher 
than the critical distance A and an infinite duration, on 
the scale of t2 in eq. (|60)l with a = 1, is needed until this 
critical distance is reached. Notice that the critical window 
here coincides with the critical window of the static phase 
transition for 2-SAT jl7| . remember, deals with the mere 
existence of satisfying rather than with the ability of some 
algorithm to find them) 

Finally, t* is such that d2{t) = 1, therefore 

t* = e„//3(p)A-i/3 + o(Ar-2/3) 
from eq. H29|l . and the relevant scaling for time is 

T^[t2 + ea//3(p)] A2/3 (72) 

according to eq. (|6()|l . where we replaced the notation tp 
with t2 to emphasize that this scaling is proper to the 
2-SAT family. 

We have to solve eq. (|68|l with this choice of scales and 
with proper initial and boundary conditions. Define 



Pno-coiitr(— > h) 7r(a;i 



0,t2 



(73) 



This is the probability that the algorithm detects no con- 
tradiction from T = up to the (rescaled) time t2. We 
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shall send t2 +00 at the end. In the case of the greedy 
UC algorithm, l/[2(l-t*)] = l/2+0{N-^/^) and 62(^2) = 
f3{p)t2 + 0[N-'^l^), so that eq. ^ reads: 

dt^TT[xi,t2) = ^9a;i7r(xi,t2) + 



(3{p)t 2x1 



Tr{xi,t2) ~ <j{t2)xi. (74) 



In practice, we find it easier to perform an inverse 
Laplace transform of eq. H68(l before solving it; this amounts 
to work with probability density functions (PDFs) rather 
than with generating functions. In particular, the diffi- 
culty of computing a that appears in eq. I|68|l is turned 
into a boundary condition on the PDF that is easier to 
deal with. 

If we plug the critical scaling of Xi, eq. (|53|l . into the 
definition of the generating function Gi {Xi ) of Ci : 

+ OC 

+ 00 
Ci=0 

and (in an heuristic way) change the discrete sum on Ci 
into an integral on c :— CiN^^^"^, letting N go to -l-oo: 



7r(xi) 



"pic) dc 



(76) 



where p{c) is the probability density function (PDF) of c, 
we see that tt{xi) is the Laplace transform of p{c) with re- 
spect to c. Here we have dropped the time dependence, but 
p(c, t2) is actually a function of c and ^2 and the Laplace 
transform is taken at fixed time. 

In terms of p{c, 12), eq. 1)741) translates into 

dt^p{c, t2) = ]^dlp{c, 12) + I3{p) t2 dcp{c, t2) - p(c, 12) . 

(77) 

This inverse Laplace transform can be performed only if 
the limit when xi +00 of the r.h.s. of eq. H74|l is zero. 
Writing from (|76(l the asymptotic expansion for 7r(a;i,t2) 
in terms of the density of clauses and its derivatives at the 
origin c = 0, 

7r(a;i, i2) = p(0, 12) / xi + d,p{Q, 12) /xl + o{l/xl) (78) 

and plugging it into eq. (|74|l . we find that a{t2) = 1/2 p{Q, t2 
and p{c,t2) satisfies the boundary condition: 



r)i/3 > 0. At the initial time step T = 0, i.e. i2init = 
—€a/P(p), there are no 1-clauses, so that 7r(a;i, i2init) = 1 
for all xi and /9(c, i2init) is a Dirac 5 distribution cen- 
tered on c = 0: the diffusing particles all sit on the c = 
point. Then, they start diffusing (second-derivative term 
in Eq. (|77jl ) because new 1-clauses are produced randomly 
from 2-clauses when variables are assigned by the algo- 
rithm. This diffusion is biased: the drift term dcp{c, ^2) 
comes from the tendency of the algorithm to make 1- 
clauses disappear (to satisfy them). A picture of this pro- 
cess may be found in the upper-right inset of Figure |H1 
where the PDF p is shown after normalization. The to- 
tal number of particles is not conserved: the absorption 
term cp{c,t2) results from the stopping of some runs of 
the algorithm, those where a contradiction (a 0-clause) is 
detected. The probability that no contradiction has been 
encountered till time t2, Pno-contr(^ ^2) = 7r(0, i2) = 
J^ J^ p(c, t2)dc, is a decreasing function of t2, smaller than 
unity. 



5.2 Results for 2-SAT and its family 

Unfortunately, we were not able to solve analytically eq. H77|) . 
Our study relies on an asymptotic expansion of the solu- 
tion of this equation for large times <2 and on a numerical 
resolution procedure to get results at finite times ^2- This 
numerical resolution was in turn helped with an asymp- 
totic expansion at small times ^2- 

Details about the large times ^2 expansion may be 
found in Appendix ICl In short, we find that the proba- 
bility that the greedy algorithm does not stop till time t2 
decays algebraically at large times. 



^dep{0,t2)+m t2 p(0,t2) = 0. 



(79) 



Conversely, one verifies that eq. H77|l supplemented with 
the boundary condition eq. (|79() leads by direct Laplace 
transform to eq. (|74|l where <j{t2) is replaced with p(0, t2)/2. 

Eq. (|77|l . supplemented with eq. (|79|l . is a reaction- 
diffusion equation on the semi-infinite axis of c = Ci/ (A^— 



Pno-conti(— ^2) 



p{c,t2)dc CX t. 



(80) 



The leading order of the probability of success at the final 

time step T = N can be guessed by replacing t2 with 
Ari/3. 

^'succoss[aR(p),p] CX A^-W ^ (81) 
an intermediate behaviour between the success (finite 



^Luccoss) and failure 



In Psuccess oc N) situations defined 



in Section 

The proportionality factor in eq. (P?T|l can be calculated 
through a numerical resolution of eq. H77|l for finite values 

of t2. 



5.2.1 Numerical resolution of eq. 1)77(1 at finite times t2 

We have solved the reaction-diffusion-like eq. ()77)) thanks 
to a standard numerical resolution scheme (the Crank- 
Nicholson method) after some preliminary steps. First we 
discretized both time and 'space' (the semi-infinite axis 
of c) . It is convenient to consider finite-support functions, 
and we have tried the changes of variables b = l/(c -|- 1) 
and b = exp(— c); the latter turned out to be better. The 
range ]0, 1] for b was discretized into M points. The Crank- 
Nicholson method allows us to take a time step 1/M for 
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the numerical resolution (quite efficient as compared to 
the time step l/N"^ for Euler's method), provided that 
the Courant condition is respected. With eq. H77() this 
is not the case, since the coefficient of the drift term, 
(3{p)t2, is not bounded with growing ^2- We actually con- 
sider p{c,t2) — exp{—(3ct2 — /3'^t2/6)p{c,t2) rather than 
p{c,t2) so that the Courant condition is satisfied. What 
we have to solve now is 

^2 ^ 



Hb)p{hM) (82) 



with Xq = and for < 6 < 1, and with the boundary 
condition 



afcp(l,i2)-/3(p)i2p(l,t2) -0 



(83) 



At initial time i2init = -<^a/P{p), p(c, t2init) = <5(c). 
The most relevant term in eq. H77|l is the diffusion term, 
and we expect c to grow like v^i2^-"?2h^ typically 
Therefore, we start our numerical resolution at time i2init+ 
Af~^^^ so that a finite number of discretization points (in- 
stead of just the point on the 6=1 boundary) share the 
support of p. The initial condition is given by a short-time 
series expansion of the solution of eq. H77|l. Details about 
this expansion are found in Appendix IdI 

We first tested our program by studying the linear 
equation eq. ifTTji . or more precisely eq. (P?^ with Xq set 
to 1. In this case, the distribution p is normalized to unity. 
We observed that this conservation rule is fulfilled by our 
numerical resolution scheme up to a small deviation of 
order 1/Af, diverging with the simulation time t2- There- 
fore, we must be careful in our choice for the final time 
of the simulation. Moreover, when we plotted the condi- 
tional average 0(^2), we found a very good agreement with 
the analytical expansions at small and large times t2 ■ This 
agreement was also observed for the non-linear equation 
eq. H82(l — see Figure 0] Therefore, we think that our nu- 
merical results are quite reliable, at least on finite time 
ranges. 



5.2.2 Probability of success in the critical time regime and 
the scaling function H 

As the numerical precision on 0(^2) is greater than on the 
total probability we have calculated the probability of 
success through the numerical results for c. The proba- 
bility 7f(0,t2) is indeed related to the values of 0(^2) by 
integrating eq. H77|l over c from to +00, see eq. (|65|l : 



dt2 In Pno- contr (-^ ^2 



-c(t2)/2 



(84) 



where we have used the boundary condition eq. Il79|l . In 
practice, we integrated c numerically from i2iiiit (which 



See also eq. (|Dj 

For instance, in the case of Xq = 1, with JV = 100 and 
at t2 — 6.5, the relative error on the total probability equals 
6.8%, whereas it is only 2.5% on c. 
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Fig. 4. Numerical data for c{t2) in the case of critical 2- 
SAT {f3 — 1 and = 0), obtained by numerically solving 
eq. 1771 . Comparisons with the analytical small- and large- 
times asymptotic expansions are shown. 



depends on p and e^) to ^2 ~ 5, and used our large-t2 
expansion (see Eq. (|5n|l and AppendixQ for t2 > 5, which 
yields 



- lnpno-contr(^ h'j^cP) = 



\nt2_ 

4/3(p) 



-H[eaj3{p)]+0{t, 



(85) 

with the following values for H at criticality (cq = 0): 
0.24371 ± 10~5 for p = 0, 0.24752 ± 10"^ for p = 1/7, 
0.20157 ± 10-^ for p = 1/4, -0.208 ± 10"^ for p = 1/3. 
These values are extrapolations to A/" = oo of numeri- 
cal results for a number of discretization points Af up to 
1600. We checked that changing the end time of numerical 
integration from « 5 to « 10 did not change this extrap- 
olation (although it notably affects the numerical integral 
for values of A/'< 1000). 

The behaviour of pno-contr in the critical time range is 
illustrated in the inset of Figure in the case of 2-SAT. 
For N = +00 (continuous line), eq. yields the large- 
t2 asymptote while data for small t2 come from numerical 
results for eq. H77|l . This compares well with results for 
finite sizes N from 25 to 1000 (points), even though the 
finite-size effects in A^~^/^ are large; for fixed t2, finite-size 
data converge to the N — +oo result but, on a series of 
data for fixed there is a cross-over from the time regime 
T = t2N^^^ to the time regime T — tN (where the correct 
scaling is illustrated in the main plot). The finite- A^ data 
were computed by direct solution of the evolution equa- 
tion H34(l for the generating function Gi of Ci, thanks 
to the technique exposed in Appendix El They have no 
Monte-Carlo error but don't take into account the Gaus- 
sian fluctuations of C2. 
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H may be viewed as a scaling function of the parameter 
Ca for the probability not to find a contradiction in the 
time scale T = 0{N^^^). We computed along the same 
lines several values of H for various 7^ at fixed p — 0. 
Results are shown in Figure [S] 



30 



20 



H 
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2-SAT- 



-15 



-10 



-5 

e 



Fig. 5. Numerical results (dots) for the scaling function H of 
the probability pno-contr in the critical time scale as a function 
of the relative distance to the critical constraint-per- variable 
ratio, in the case of 2-SAT (p = 0, /3 = 1). The error bars are 
smaller than the symbol's size. The dotted line is a guide for 
the eye. Left inset: y := exp{—4:H) is plotted VS. €a. to show 
that, when —00, H ^ — ln(— eQ)/4. The straight line has 

equation y — — Eq. Right inset: H^^^ is plotted vs. ta to show 
that, when +00, H oc e^. The straight line is a tentative 

linear fit. 



Let us check heuristically that the success and failure 
cases are recovered from the critical results when tends 
to —(X) and +00 respectively; this amounts to precise the 
large-ec behaviour of the scaling function H. fixes the 
initial date t2init = —^a/P{p) in the time scale of t2 and 
this in turn influences the value of H . 

For a < aYi{p) i.e. Ca — —SN^^^ for some fixed S > 0, 
c{t2) reaches very quickly its asymptotic regime for large 
t2- c{t2) ~ l/(2/3t2), and we obtain 

- lnp„o-contr(i2init = SN^'"" tN^^ ~ 



'^^^ = Ts(lni-ln<5). (86) 



5/0m/^ 4/3i2 4/3 



Pno-contr is finite, as expected in the success case. This 
computation shows that the scaling function H should 
behave like — ln(— eQ)/(4/3) for large negative e^; this is 
confirmed numerically in the left inset of Figure 



Above the critical threshold, for ea = -\-SN^/^ with 
(5 > 0, c is driven away from at speed « S/jSN^^^ from 
time t2 = -5/(3N'^/^ to time (see eq. (|ZZ|)), hence c w 
^2/^2^2/3 foj, duration « Hence, 



- Inpn 



r(t2init = -5N^'^ ^ t2 



1 



12/3 1-^+^^ 



(87) 



Pno-contr vanishcs exponentially with N as expected. H 
should behave like for large positive ea', this is con- 
firmed numerically in the right inset of Figure |S1 



5.2.3 Matching together critical and non-critical time scales 
— final result for Psucccss 

Equation H85|l may be written as, setting t2 + ea/P = 



lnP,uccess(T = 0^tiV) 



^ ilniV-t-lni 



4/3(p) V3 



H[e^,P{p)]+o(^-Jj^^ +N-'/'Q{tN'/',N) 
where Q{t2,N) is bounded when N +00 with fixed 

i 14 
E2 

The behaviour of Pno-contr for times T of the order of 
N is illustrated in the main part of Figure (for 2-SAT), 
where — Inpno-contr — In iV is plotted as a function of t. 
The dashed line is the N = +00 result i(lni-t) (0, 1) 
(see below for the expression of Q). Data for finite-sizes 
N (points) compare well with this result if T ^ N"^/^; 
otherwise there are strong finite-size effects and the critical 
time regime results are relevant (see inset). 

Outside the critical regime, the probability that no 
contradiction is found can be calculated along the lines 
of Section 14.11 In the iV —> -fcx3 limit with fixed ratio 
T/N , the probability that no 0-clause is found between 
times < t < 1 and 1 satisfies 



- Inp, 



-contr(r = tN^N) = 

dr d2(T)2 



4(1- 
Int 



d2{r) 



+ 0{N 



+ / dT/(r)+o(iV-i/3^ 



(89) 



since the reasoning that led to eqs. 1)42(1 and H44|) is still 
vahd here: if tjnit > and a = aB.{p){l + £aN~^/^) accord- 
ing to eq. (|71|l . we know that d2{t) is bounded away from 
1 when N — > +00. Notice that the subdominant term in 
eq. ((89(1 is not of order 1/A^ like in eq. ((42(1 because we 
approximate a with a^ . The expression for function /, 



fit) 



ipt 



9p2 



8(1 -p) 4(2-5p)(3pi-l-2-5]5) 



(90) 



Q is like the sum of the asymptotic expansion of 
— InPsuccess in powers of N~^^^ without the leading term. Our 
aim is to precise how Q{t2,N) behaves when both t2 and A*' 
go to +00. 
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(without the In factor if Xq — 1) and 



-0.3 



-0.6 



t = T/N 



Fig. 6. Illustration of the two relevant time regimes for the 
probability Pno-contr that no contradiction is found between 
T — and some step T. Points are data for the exact solution 
of eq. (I34II in the case of 2-SAT at criticality (q — 1) for sizes 
= 25, 37, 51, 101, 151, 201, 251, 501, 751 and 1001 (increas- 
ingly close to the N = -f-oo asymptotic lines). Main plot: 
— Inpno-contr — ^ lu A is plotted vs. t = T/N. This quantity 
has a well-defined continuous limit when N +oo with fixed 
t > (dashed line); see text for the computation of this limit. 
Because of the cross-over from the critical time-regime, con- 
vergence is non-uniform; finite-size effects are huge except if 
T > N^^^. Inset: -Inpno-contr is plotted vs. t2 = T/N^'^; it 
has a well-defined continuous limit when N +oo with fixed 
t2 > (solid line). Convergence is non-uniform; finite-size ef- 
fects are huge in the cross-over regime T ^ N^^^ . Data for 
the solid line are from numerical solution of eq. H77|l and the 
asymptotic expression (1851 . 

is found from eqs. (|37|l and (|29|) . 

Comparing eq. and eq. yields 

N-^/^QitN^/^, N) ^ F(i) + O [n-^I^^ + O {r'^N-^) 

(91) 

where is a primitive of /. Using this expression for Q in 
eq. lIHSJ), setting i = N^^/^{t2 + ea/P) and letting t2 go to 
shows that F{0) — 0. Finally, the probability of success 



— In P 

^ success 

with 



Fil) 



InN 
12/3(p) 

ln/3(p) 3p 



(92) 

4 - p 



(93) 



Pip) 8(1 -p) 16(1 -p)- 

The corrections due to the fluctuations of C2, tem- 
porarily left aside, are of the order of, from eq. (|26|l . 



o 



N' 



3/2-5 



exp 



where S has to be in the range ]l/2, 2/3[. They are negli- 
gible w.r.t. all other terms of eq. (|92|l . Notice that i5 has 
opposite effects on the two corrections, as was anticipated 
in the discussion following eq. 

Let us give precise values for some special cases, to il- 
lustrate the predictive power of our computation, although 
we have no analytical formula for H. For the greedy algo- 
rithm (Xq — 0) at the critical point (aR,(p)), — InPgucccss 
equals, up to O (N^^/^), 

In 7V/12 + 0.24371 ± 10"^ - 1 /4 (94) 

In 7V/9 + 0.24752 ±10"^ -9/32 + ln(3/4)/12 (95) 

5/16-ln(2)/4 (96) 

11/32- 31n(2)/2 (97) 



In 7V/6 + 0.20157 ± 10"^ 
lniV/3- 0.208 ± 10~3 



for 2-SAT, 2+1/7-SAT, 2-fl/4-SAT and 2+1/3-SAT re- 
spectively. 

Figure d compares these results with empirical suc- 
cess probabilities, obtained by running the greedy UC al- 
gorithm on a large number (4.10^ to 3.10^) of instances 
of random 2+p-SAT at critical initial clauses-per-variable 
threshold a^iip) with sizes up to A^ = 10^. Interpretation 
of the data could be difficult because finite-size effects are 
strong. But if we take into account the finite corrections 
to the In A^ terms in eqs. (|M | - (|??7|l . a very good agreement 
is found. 

5.2.4 The critical distribution of Ci 

In the critical time regime (T of the order of N^^^), the 
PDF of c = Ci/A^i/3 > is the solution p of eq. (EHl- As 
a special case, the probability that no 1-clause is present 
IS tr ~ Af-i/V(0)/2. Conver gence to this distribution, for 
c > on one hand and for c = on the other hand, is 
observed numerically — see Figure |H1 The convergence is 
not uniform in the neighbourhood of c — 0, which is ex- 
pected since the distribution p is singular in c = 0. There 
is rather a cross-over from the regime where Ci <C N^^^ 
to the regime where Ci is of the order of N^^^. Eq. H39|l 
yields, going to the c?2 ^ 1 limit (well-defined if Xi > 0), 



1-Xi 



1 - Xi exp(l - Xi 



-MO) 



Hence (for large N with fixed Xi) 
G,(X,) = A.-V3K0) 1-X, 



(98) 



+ o(a^-i/3) . (99) 



2 1 - ATiei-^i 

The probabilities that Ci takes the values 0, 1, 2, ... are 
given by the coefficients of the Taylor expansion, in Xi — 
0, of Gi above. It is observed that these probabilities con- 
verge very quickly to N~^^^ p{0), which coincides with the 
c ^ limit of the distribution p of c = Ci/N^^^. These 
probabilities are plotted in the lower-left inset of Figure|Hl 
together with numerical data for finite sizes. A good agree- 
ment is found. 
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Fig. 7. Comparison of eqs. Il94ll - (I97II (lines) with empirical 
estimates (symbols) for the probability of success of the greedy 
UC algorithm on instances of the random 2 + p-SAT problem 
at the critical initial constraint-per-variable ratio aR,(p) (i.e. 
ta = 0) for p=0, p = l/7, p=l/4 and p = 1/3, and data for 
the probability of success of the greedy HL algorithm I15| on 
instances of critical 2-SAT. The error bars are smaller than the 
symbols' sizes. Data for — In Pauccess / In A'^ are plotted against 
1/lnA, and the straight lines come from our asymptotic an- 
alytical results, eqs. I94H97II (ignoring the ©(A-^/^) terms), 
except for HL where it is a tentative extrapolation to A = -|-oo. 

5.2.5 Universality 



Fig. 8. The normalized PDF p of c = Ci/A^^^ for 2-SAT at 
the critical initial clauses-per-variable ratio a — 1 (e^ — 0) and 
at t2 = 0.3. Long-dashed line: limit result for p at A = -|-oo. 
Dots: numerical results from 8.10^ runs of the UC algorithm 
for A — 10^ (A; the error bars are smaller than the symbol's 
size) and results from the direct solution of eq. (I34II for A — 
501 (■) — see appendix lEl Lower-left inset: probability P 
(expressed in N~^^^ p{0) /2 units) that Ci takes finite values at 
the A — +00 limit (•). This shows how the discontinuity, at the 
scale of c, between N^^'^P{Ci = 0) = a and limc->o+ p(c) = 2a 
is resolved at the scale of Ci. Numerical data (same as for the 
main plot) converge to P when A +oo. Upper-right inset: 
Normalized distributions p for A — +<x at times t2 = 0.1 (solid 
line), t2 = 0.3 (long dashes) and t2 = 1.5 (short dashes). 



For a given p < 2/5, all algorithms that use the UP rule 
fall into the same universality class (which depends on p) . 
They share the result eq. H92|) with common H (but F{1) 
is a non- universal correction) , and the critical distribution 
of Ci studied in Section 15.2.41 

The reason is two-fold: first, the analysis done so far in 
Section is still valid for another heuristic than R, run on 
random instances of the 2 -|-p-SAT problem, provided the 
critical trajectory starts on the d2 = 1 line and is secant 
to it with some slope (3 > 0. Second, the value of P at 
criticality is universal and depends on p only, because, at 
criticality, the heuristic is almost never used and UP alone 
fixes the slope: even if the resolution trajectories of several 
heuristics may be quite different in general (compare e.g. 
Eqs. (|2nil for R and (PU for GUC), in the critical regime, 
the probability that Ci ~ and the heuristic is used is of 
the order of N~^^^ only. Most of the time, the UP rule is 
used, and the resulting evolution of C2 and C3 is common 
to all algorithms: the slope of the critical trajectory is 
P{p) = (2 - 5p)/[2(l - p)] as for UC. We verified this by 
direct computation from eq. for R, eq. (pTT]! for GUC 
and the corresponding equations of reference J2| for HL 
and CL. 



Figure [71 shows the agreement of empirical data for the 
HL heuristic, used on random 2-SAT instances, with the 
scaling of Psuccoss that we derived for the R heuristic. 

6 The 3-SAT class (stretched exponential 
class) 

6.1 Equations and results for 3-SAT and its class 

We now address the case p > 2/5. Here, the critical res- 
olution trajectory starts below the d2 — I line and gets 
tangent to it, at point {p = 2/5, a = 5/3) at a finite time, 
t* €]0, 1[. From eq. (|29(l . d2{t) is locally a parabola around 
t* : I — d2{t ~ t*) (X {t — t*Y. The critical resolution tra- 
jectory is at distance A — N^^^^ of the d2 — 1 line as 
long as t — t* is of order Z\^/^: the exponent a equals 1/2 
here and the relevant equation is eq. H69(l . not eq. (|68|l as 
for 2-SAT. The computation is easier here (at least for the 
leading order) since we have an ordinary differential equa- 
tion (the time enters into play only as a parameter of the 
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coefficients of this ODE) rather than a partial derivatives 
equation. The relevant scaling for time is 



(100) 



according to eq. l|Hn| where we replaced the notation to 
with to emphasize that this scaling is proper to the 
3-SAT class. 

As for 2-SAT, the critical regime extends to a non- 
empty range of values of a. This critical window is the 
same: we set 



= aR(p)(l-f e,7V-i/3) 



(101) 



with finite Ec,. Indeed, if a is less than a^{p) by more 
than A — N~^/^, because c?2(^) is increasing proportion- 
ally with a (see Eqs. (|^ and (EDl), the resolution trajec- 
tory will be out of the critical region in particular at the 
time t* where d2{t) is maximal since 1 — d2(t) ^ A, and 
therefore at all times. Conversely, if a is above aR(p) by a 
distance much greater than A, at time t* l~d2 is an order 
of magnitude higher than the critical distance A, which 
implies that the resolution trajectory would stay for an in- 
finite duration, on the scale of in eq. H()U|) with a — 1/2, 
above the d2 = 1 hnc - this would yield numerous contra- 
dictions (0-clauses) and let the probability of success be 
exponentially small. 



6.1.1 Results for the critical time regime 

We now have to solve eq. (|69|l . As for the 2-SAT family, 
we prefer to do computations on the (here normalized, or 
conditioned to success of the greedy algorithm) PDF p of 
the stochastic variable Ci/N^^^ rather than on its gener- 
ating function 7f . Performing an inverse Laplace transform 
on eq. yields 

= \dlp{c, h)+e2{h)dc'p{c, ^2)+ j_ [c(t3)-c]p(c, ^3) 



with the boundary condition 



(102) 



62(^3)^(0, ia) = 0. 



The parameters in eqs. (|102f) and H1U3|I are t* — 5/6 — 
l/(3p) and 62(^3) = 36t2p2/(p + 2f - e„. 

Here, the initial condition is mostly irrelevant: the ini- 
tial step of the algorithm, T = Q or t* —00, is far out 
of the critical time region (finite t*). When the resolu- 
tion trajectory enters this region, the distribution of Ci 
has already equilibrated to its critical value and is only 
subject to 'adiabatic' changes during the crossing of the 
critical region (Eq. (|102|l has no time derivative). Solving 
the ODE H1U2|) brings out the explicit critical distribution 
of 1-clauses. 

Let u(c,t3) := p(c, is) exp[e2(t3)c], k{p) = [6p/(2-|- 
p)]~^^^ and 



c-cih) + ^^e2it3f 
bp 



/Hp) 



(104) 



Eq. 11U2|I is recast into an equation that admits Airy's Ai 
and Bi functions as linearly independent solutions: 



d^u{z, ts) — zu{z, t^) — 0. 



(105) 



See references in for studies of eq. (|105|l in the con- 
text of (semi-classical) quantum mechanics or |24) in the 
context of Brownian motion (similar to our situation). 
Since u has to vanish for large z (because /3(c, ts) — > 
for large c) whereas Bi(z) is not bounded for large z, 
u{z,t3) = A{t3)Ai(z) where A is a normalization coeffi- 
cient. The boundary condition eq. (|103() reads 



Ai'(zo)/Ai(zo) = -k{p)e2{h) 



(106) 



where zq is expressed from eq. ()l()4ll with c = 0. Let A 
be the reciprocal function of Ai'/Ai. Inverting eq. Hl()6|l 
yields an expression for 0(^3): 



c{h) ^ k{p)^e2it3 f 

and 
P(c, ^3 



k{p)A[~k{p)e2{ts)] (107) 



A(i3)e-^^(*^)=Ai {k{p)c + A [-k{p)e2{h)]} . 

(108) 

We did not compute explicitly the normalization constant 
^4(^3). The critical distribution p is plotted in Figure |51 
for 3-SAT {p — 1) and several values of 62, to show the 
influence of the drift on its shape. The agreement with 
numerics is good; the same phenomenon in c = as for the 
2-SAT family is observed: at iV = 00, p is singular, and for 
finite N there is a cross-over from the regime Ci <C N^^^ 
to the regime Ci = cN^^^ (see Sect. 15.2. 4l l. 



The probability that the greedy algorithm doesn't find 
a contradiction in the critical regime from time ig^'' up to 

time t^"^ satisfies, according to eq. ()64f) and the discussion 
preceding it, 

Inpno— contr 



.(1) 



,(2) 



(4^))-^(4^))]+0(iV-V6) (109) 

where, from eq. and the initial condition pno-contr(r = 
0) = 1, dt^p = k{p)~^c{t3)/2 and pits) ^ as ^3 — > —00. 

(103) 

In the interesting situation where ^3^^ < < t^^\ using the 



y variable such that Ai' (y) / Ai{y) :— 
than ^3, 



-fc(p) 62(^3) rather 




^[-fe(p)e2(tf )]' 

Ai'(y)2 



^k{p)e^-Ai'iy)/Aiiy) 
Define (the integral is finite) 

dy 



Ai(y)2 



y 



(110) 



+00 



A{x) Ai'{y)/Ai{y) 



Ai(y)2 



(111) 
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Fig. 9. Critical distributions p for c = Ci/N^^^ in the case of 
3-SAT with drifts 62 = 0.5 (solid line, 0)i C2 = (dashed line, 
■ ) and 62 = —0.7 (dotted line, A). Points are from numerical 
estimates with size N = 10*. 



(2) 

For large positive ig 

4^^ 



(and similarly for large negative 



M (4'^) - m(o) = ^/mmp)^ 



k{pf 



4t 



(2) 



o 



(112) 

where we have used a large-ia expansion of eq. (|107|l . From 
eq. UlUUfl . is of order N'^/^ at most, thus eq. H112|l allows 
one to express in terms of only, up to corrections 
of the order of N^^/^ . Anticipating that the non-critical 
time regime, like the success case in Section 14.11 brings 
contributions to InPgucccss of order C(l) in N , the total 



probability of success (at time T 
algorithm reads 



N) of the greedy UC 



- lnPsuccoss[(l + ea)ct'R{p),p] = 

iVi/62v/fc(^^>[fc(p)ea] + 0{1) (113) 

where the function ^ is closely related to the univer- 
sal function introduced in reference It may easily be 
computed with a mathematical software and is plotted in 
Figure Uni 

Let us heuristically check that the success and failure 
cases are recovered from the critical results when tends 
to —00 and -f 00 respectively. Using the asymptotic expan- 
sions of Ai and Ai' (25i.26j , the asymptotic behaviour of 'P 

To lighten notations here, we have rescaled the argument x 
and the value of by constant coefficients w.r.t. reference 0. 
Moreover, the new $ is more universal: it is exactly the scaling 
function at the tricritical point p — 2/5, a — 5/3 for all heuris- 
tics (both r% and equal one in this case) — see section E21 



is found: 



<P{x) 



8y 



when X 



^5/2 
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^iy/x) when x +co 



where a ~ —2.338107410 is the greatest zero of Ai on the 
real axis. Taking now Ca — iV^/^e shows that — In Psucccss ~ 
7r/(4^/^) for e < 0, in agreement with eq. (g^, and 
— InPsucccss ~ 4eiV/15, in agreement with the expected 
failure behaviour. 



6.1.2 Matching critical and non-critical time scales 
result for Psucccss 



final 



Here we use a heuristic reasoning, based on what we learned 
from the study of the 2-SAT family. On the one hand, the 
results from the previous paragraph show that the prob- 
ability no to find a contradiction between times t*N and 
T ~ t*N + t^N^/^ equals (with an obvious convention for 
negative t^) 

- lnp„o-contr(T = t*N ^ t*N + <3iV5/6) = 

N^/'^W^)- ^lm+R{U,N) (114) 

where the function R{t^,N) (assumed to be regular) is 
bounded when N — s- -foo with fixed ^3. Setting ^3 = 
{t-t*)N^/^ and using cq. (|112(l . 



- lnpno-co„tr(r = t*N ^ tN) = N^'^ ^/k{p)^k{p)e^] 
k{pf 



^— + {n-^'^^ + R[{t - t*)N^'^,N]. (115) 



4 t- 



On the other hand, out of the critical time regime, 
we may modify eq. (|89|l (with the expression of d2{t) for 
a = 5/3) to compute the lost of success probability in the 
large N limit between given times on the scale of T/N . 
For t <t* and t > t* respectively. 



lnp„o-contr(7' = ^ tN) 



k{pY 



1 



1 



t - t* 



iln(^)+^*../(.)+0(iV-V3) 



(116) 



- lnpno-contr(r tN 

k{pf ( 1 



4 



4 VI 

1 - r 



.N) 
1 

t-t* 
1 



where / is the function 

3p 9p2 



2(p-H2) (p + 2)2 



(t-t*) 



(117) 



(118) 



Eqs. H116I - [TT7|) share with eq. H115|) a divergence in 
1/ (t — t* ) , but they also have a logarithmic divergence 
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that does not appear in eq. H115|l . We speculate that, if 
we pushed the asymptotic expansion for large N that led 
to eq. (|114|l one step further, we would find 



Rit3,N)^g{t3) + Sit3,N) 



(119) 



with g^ts) ^ ln(|t3|)/4 when ^3 — > ±00 and with S(t3,N) 
regular and bounded in the two limits, first N — s- +00 
with fixed ^3, then ^3 — > ±00. At this new order, the time- 
derivative term 9tQ7f(a;i, to) that was canceled to write 
eq. H69(l becomes relevant. It yields a correction to the 
probability of success that originates physically from the 
slow, 'secular', evolution of the shape of the probability 
distribution p of c — Ci/N^^'^: the solution of eq. is 
the distribution of c in a true stationary state, but here we 
have only a quasi-stationary situation (c is slowly driven) 
and the actual distribution is always delayed w.r.t. the 
perfectly equilibrated solution of eq. (|69|l . 

With the assumption l|119|l . eq. (|115|l reads, for t^t* , 



lnpno-contr(r = t* N - 



> tN) 



k{py 



4 t~t* 



±iln|i-r|T5(0) + o(A^-^/') + 
S[{t-t*)N^^^,N] (120) 
and comparing eq. (|12()|l with eqs. (Illfil - ITTTjl yields 

where and are two primitives of /. Replacing ex- 
pressions (ESI and (fnT|l with t = t*+t3N-^/^ in eq. (|114|l 
and using the regularity of 5 in ^3 = shows that 



/(r)dT 



(122) 



Finally, adding eq. (|120|l for the two cases t ^t* after 
making the substitution eq. H122() yields the total proba- 
bility of success of the greedy algorithm UC=UP+R for 
p > 2/5: 



InR 



success 



with 



E{p) = 



N^'^2^/k(p)^[k{p)t 



3p 



(123) 



3p(p-4) 

2(p + 2)2 2(5p-2) 



4 



6p 



bp -2 



- 1 



(124) 

While the N^^^ divergences of the two cases of eq. (I12U|I 
add up, the InA'^ divergences cancel out. Similarly, if we 
sett = t*-T in eq. I|116|l and t ^ t* + t in eq. H117(l . add 
the results and let r ^ 0, the In |i — t*| divergences can- 
cel out. This seems reasonable since the slow adaptation 
of the shape of p is symmetric w.r.t. the time t*: before 
t* , the driving term in eq. H69|l pushes c away from and 
the equilibration delay of the distribution of c makes the 



actual c smaller than the perfectly equilibrated c. Hence 
the IniV term in eq. H12U|) for t < t* has a negative con- 
tribution to the probability of finding two contradictory 
1-clauses. Conversely, after t*, the driving pulls c towards 
back. The delay of c makes it larger than what the per- 
fectly equilibrated c would be. This yields a positive In N 
correction in eq. H120|) . The balance of the two slow adap- 
tations is null for symmetry reasons. 

The corrections due to the fluctuations of C2, tem- 
porarily left aside, have order, after eq. H26|l . 

C)[7V'5-4/3gxp(-Ari/6^)] and O \N^^^~'^e^p{-N^^-^/2 

with possibles choices of 6 in the range ]l/2, 2/3[. Taking 
5 in the range ]7/12, 2/3[ ensures that both corrections are 
negligible w.r.t. all terms of eq. H123|) . 

The result ()123ll is compared to empirical success prob- 
abilities of the greedy UC=R-|-UP algorithm on a large 
number (2000 to 7 x 10^) of instances of the random 3- 
SAT problem with sizes up to = 20000 in Figure COl In 
spite of strong finite-size effects (in 1/A^^/^), there is an 
excellent agreement because eq. ()123ll provides also the 
first subdominant term. 



0.5 



R N = 20000 
A R N=1000 
■ GUC N=20000 
□ GUC N=1000 




X 



Fig. 10. Empirical data for the probability of success of 
the greedy algorithms with R and UC heuristics on 10^ to 
7 X 10"" instances of the random 3-SAT problem, for several 
initial clauses-per-variable ratios. Error bars are smaller than 
the symbol's sizes. Dotted lines are guides for the eye. Data 
are plotted after rescaling their x and y axis with coefficients 
r* and to compare them with universal scaling function <1> 
(solid line), according to eq. H126|l . 
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6.1.3 Universality 

Any heuristic H run on a set of random instances with self- 
averaging C2 and a typical C2 such that, for a given initial 
constraint-per-variable ratio a, the resolution trajectory 
becomes tangent to the c?2 = 1 line at a finite time t* e 
]0, 1[ with 

l-d2it-t*)(x{t-t*)^ (125) 

has the same critical behaviour as R. Indeed, in such a 
case, the generating function for Ci satisfies eq. H34|l and 
one may use critical scalings for the quantities Ci, T, a 
and Psuccess to derive eq. (|102|l from eq. In these 

scalings, the exponents are independent of H because the 
geometric situation expressed by eq. (|125|l is the same as 
for heuristic R. Solving eq. H102() yields the same scaling 
function (p as for the R heuristic, i.e. there exists numbers 
an, rfj and (that depend on H and p) such that 

- lnP,ueccss[aH(l + ec.N-^'^)] = 

r% (r^° e„) + Eh + (n-^''') . (126) 



Eh is a non-universal correction (even the contribution 
from a primitive of the universal term ln|i — t*\/A in 
eq. H120|l to Eh is not universal because t* depends on H) . 
Scaling relation H126|l is expected to hold for most, if not 
all, algorithms using UP on random 2 -I- p-SAT instances 
with p > 2/5. For GUC we performed analytic computa- 
tions on the basis of eq. ^'M^ . The values of the numbers 
above are, in the case of p — 1, i.e. random 3-SAT, with 
GUC heuristic: 

acuc ~ 3.003494331 

is such that 3aGuc/2 — ln(3Q:Guc/2) = 3 
rguc = a~u/c^aR(l)i/i225/6 « 1.764223038 
r^°uc = (3aGuc/4- l/2)a^Jj/?aR(l)i/62-i/3 

« 1.363750542 
Pguc(1) ~ -1.2438849. 

Empirical data for the probability of success of the UP+ 
GUC algorithm are compared with the universal function 
<P in Figure irni — as for heuristic R, the agreement is very 
good, despite strong finite-size effects. 

Notice also that the point where the critical resolution 
trajectory gets tangent to the d2 = 1 line is universal. At 
this point, the residual 2-clauses-per-unassigned- variables 
ratio d2 = a{l — p) = 1 and the residual 3-clauses-per- 
unassigned- variables ratio ap = 3/2 so that each affecta- 
tion of variable through UP produces, in average, a new 
2-clause from the remaining 3-clauses — this is why d2 {t) 
has a vanishing derivative and the trajectory does not 
cross the ^2 = 1 line. Moreover, the resolution trajectory 
(e.g. its curvature) is locally the same for all heuristics 
since almost all time steps use UP; the chances that the 
heuristic rule is used in one step during the critical regime 
scale like P(Ci = 0) ~ iV-i/^. Therefore, improving some 

For heuristic R, — 2^k{p) and — k{p). 



heuristic may only affect the pre- and post-critical time 
regimes. A good heuristic is one that does its best to avoid 
the critical region, or to delay entering it as much as pos- 
sible. 



6.2 The special case of 2+2/5-SAT 

6.2.1 The tricritical point {p = 2/5, a = 5/3) 

For p — 2/5, the critical window for a is the same as for 2- 
and 3-SAT, a = 5/3(1 + eaN-^^^). The critical resolution 



trajectory is tangent to the ^2 = 1 line so that — In 



success 



scales like N^^^ like in the 3-SAT class. In addition, the 
delay of the actual distribution p of c = Ci/N^^^ w.r.t. 
the fully equilibrated distribution that solves eq. H69|l con- 
tributes to the success probability with a non-vanishing 
subdominant In N term. This is because, instead of re- 
versing its direction, the driving of c is directed towards 
during the whole algorithm's run, for the critical resolu- 
tion trajectory starts on the d2 ~ 1 line. 

For times T of the order of N^^^ , eq. (|102|l is relevant 
(with t* — and 62(^3) = <§ — £„). In the expression UllOfl . 

k{p) = 1 and tg^'' has to be 0. This yields 



-InPs, 



5., ^ 2' 

3(l + .«),5 



= ^'(e„)iVi/6 + o(iVi/^). 



(127) 

The critical distribution p of Ci/N^^^ is the same as for 
the 3-SAT family, up to scaling factors. 

As for 3-SAT, we did not compute directly the correc- 
tion due to secular evolution of p but we deduced its 
contribution to the final result by comparison between the 
time scales of h = T/N^/^ and of t = T/N. For fixed t 
and large N , eq. (|12U|I reads here 

- lnpno-contr(r ^ ^ tN) ^ N^'^'H^o) + ^ ^ " 

-f i ln< - .g(0) + O (iV-i/^) + S{tN^^^, N) (128) 
while eq. (|117|l reads 

- lnp„o-contr(T = tiV ^ iV) = 1 Q - 1^ - 1 Int 
-\ + \t+\e+0[N-'''). (129) 

Thus S{tN^/^, N) = ~t/A - t^/8 + .g(0), and 
-InP^ucccss = iVi/6<^(e„) + ln(iV)/24-5/8 + o(l) . (130) 

This expression compares well with numerical estimates 
in the critical ea — case, see Figure ITTI 

As a side remark, in the range of time steps of the 
order of N^^^, eq. (|77|l with vanishing P{p) is relevant. 

This would be possible by keeping a further order in the 
expansion of eq. 1341 and supplementing eq. 1691 with a PDE 
where p appears as a driving term. 
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Fig. 11. Comparison of empirical estimates of Psucccss with 
prediction eq. 111301 for — 0. We plot — [InPsuccess + 
^(0)]/ In(iV) as a function of 1/ ln(A''). The straight line is the 
analytical prediction, 1/24 — 5/8/ ln(A''). 



Numerical evidence shows that its solution p, once nor- 
malized, converges to the PDF p that satisfies eq. (|102|l 
with 62 = t* = 0, which is natural since p is the station- 
ary solution of eq. H77|) . This equilibration process takes a 
finite range of time T/N^^"^ , but a vanishing range of time 
^3: this is why the solution of eq. H1U2|) yields a finite value 
for c{t3 — 0), whereas Ci at T = is 0. 

For p = 2/5, the scaling function is truly universal, 
in the sense that r*^" = = 1 for all heuristics. Indeed, 
the resolution starts already in the critical regime where 
UP is used at almost every step and the heuristic becomes 
irrelevant; the trajectory d2{t) is then locally universal. 



for ea < and ep < 0. If we send Sp to —00 (as e.g. 
—N^/^q) with ea <C |ep, ^ behaves like 1/ep and Psucccss 
is finite. This was expected since, in this case, we dive into 
the success region below the d2 = I line. However, if we 
follow the c?2 = 1 hne and set = |ep, 'P behaves like 
-ln(ep)/ep and -InP^ucccss = ln(A^)[l/24- l/(20q)] to 
the leading order in N, which matches the singularity H51(l . 



6.3 Case of K-SAT with K > 4 

For general K, that is for random instances with initially 
N variables and C2 2-clauses, C3 3-clauses, Ck K- 
clauses, d2(t) may become tangent to the d2 — I line with 
an exponent greater than 2: 1 — (i2(t — t*) oc (i — t*)" with 
n < K, and the scaling exponent A for — In Psucccss may 
take the value 5(1 — This happens when reduction of 
j-clauses into j — 1-clauses compensates exactly the lost of 
i — 1-clauses for j > 4, so that d2{t) stays longer close to 
the critical d2 — \ Hne. n is necessarily integer because C2 
is computed after solving a triangular system of equations 
like l|27|l . When n is odd, the critical resolution trajectory 
crosses (with vanishing slope) the (^2 = 1 hne, coming from 
the failure region {d2 > 1) into the success one. Thus the 
critical behaviour may be reached only if the trajectory 
starts on this line (initial C2 = N] otherwise, the trajec- 
tory stays for a number of time steps < T < of the 
order of N in the failure region d2 > 1 and the probability 
of success vanishes exponentially) , and — In Psucccss is nec- 
essary accompanied with a In A'^ subleading term (because 
of the secular equilibration of the critical distribution of 
Ci, like in the 2 + 2/5-SAT case). 

The critical behaviour of the 2-SAT family is recovered 
if the initial clauses-per- variable ratios aj = Cj/N are 
low enough, with initial a2 = 1, so that the resolution 
trajectory is secant to the (i2 = 1 hne at time t = 0. The 
results for the R heuristic are universal (see Sect. I5.2.5|l 
and eq. (p?^ leads to 



cx N' 



(132) 



6.2.2 Matching the 2-SAT family with the 3-SAT class 

The refined scalingp = 2/5{l+epN^^^^) allows us to blow 
up the transition between the 2-SAT family where Psucccss 
decays algebraically with exponent ^{p) and the 3-SAT 
class where it decays as a stretched exponential. Now, the 
critical window for a is a = |(1 + |epiV^^/^ + e^iV"^/^), 

62(^3) = (^3 - fep)^ - jCp - Crv and 



- lnPs„cccss = Af^/'<Z>(ea, ep) + ln(iV)/24 + 0(1) 



with 

^(ea,ep) 



+00 



dy 



9 ""pi 



ie2 



Ai'(y)/Ai(2/) 



Ai^(y)^ 
Ai(y)2 



(131) 



provided a2 = 1, q;3/q;2 < 2/3, a4/a3 < 1, a^/a^ < 6/5, 
aK/aK-i < 2{K-2)/K. 

As for 2-|-p-SAT, the critical initial clauses-per-variable 
ratios that yield a stretched-exponential behaviour are not 
universal, but the position where the resolution trajec- 
tory meets the fi2 = 1 hyperplane (with exponent n) in 
the space of the a^ 's is. It lies on the boundary of the 
region that inequalities above define: if n = 2, 012 = 1, 
as — 2/3 and 014 < 2/3. If n = 3, 04 = 2/3 additionally 
but as < 4/5, and so on. The trajectory can't reach tan- 
gentially the boundary twice (because, in the neighbour- 
hood of the points where the boundary is hit tangentially, 
the flow is going away; coming for a second time to one 
of these points would need to re-increase some clauses- 
per-variable ratios, which is impossible precisely because 
of the inequalities above) ; thus — In Pguccess can have at 
most one power-law divergence (i.e. only one power of N) 
and one In divergence (always present when n is odd). 
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In the situation where the greatest power of N that 
appears in — InPguccoss is (1 — l/n)/3, the technique of 
computation based on eq. (|26|l may still be applied, pro- 
vided that 6 e]2/3 - l/(6n),2/3[. The shrinking of this 
interval as n — > -l-oo probably means that large deviations 
of C2 for finite size N have more and more influence (on 
the statistics of Ci and on Psucccss) as n is increased. 



7 Discussion and perspectives 

7.1 How well does UP estimate the ground state 
energy of 2 + p-SAT formulas? 

At the critical initial constraint-per- variable ratio a, the 
probability Psuccess that an algorithm using the UP rule 
finds a satisfying assignment to a random formula van- 
ishes algebraically with large A^. In contrast, for 2-SAT 
and a = 1, such a satisfying assignment exists with fi- 
nite probability (17) . that we numerically estimated to 
0.907 ± 10^^ - see Appendix ^ To this respect, even 
though the dynamic threshold of algorithms using the UP 
rule is equal to the static threshold acip) forp < 2/5, they 
perform not very well at the threshold (at least for p = 0). 
But we argue here that they don't overestimate too much 
the minimum number of clauses that can be satisfied si- 
multaneously in an instance of SAT. This number is also 
commonly referred to as the ground state energy of the 
instance. Finding it amounts to solve the so-called MAX- 
SAT optimization problem, and, like the SAT problem 
this problem is classified as difficult by the computer sci- 
entists. It is known to grow like (a — an)^ for a > or [77] . 

If we allow the greedy algorithm to keep running even 
if 0-clauses are found, the final number Co of 0-clauses is 
an upper bound to the ground state energy. The average 
Co may be computed from the generating function Goi 
that satisfies eq. H34|l with Xq = 1. Using non-zero values 
of Xo doesn't modify substantially the computations of 
Ci that we did so far. Eq. l(77|l reads now 



dt^p{c,t2) = -dlp{c,t2)+P{p)t2dcp{c,t2)- 
and eq. (p^ becomes 

Goi(^ tN^''';X^,X^^\) = 



l-Xr 



Cp{c,t2) 

(133) 



(134) 



InN 



where H depends also on Xq. Deriving eq. (|134|l w.r.t. Xq 

and setting Xq = \ yields 



(135) 



^lnA^ + 9x„i/[Xo,e„,/?(p)]|^^^^^ +0 . 

We know that, if Xq = 0, H[XQ,ea, I3{p)] behaves like ej^ 
for large positive Eq,. Assuming that the behaviour is the 



same for general Xq^ the average number of 0-clauses at 
the end of the critical time regime is 



(Co)(-> tN^/^) « h{p)[a-a^{p)fN- 



1 



12/3 (p) 



\tlN + 0{1) 
(136) 

for some universal number h{p) (as in Section 15.2.51 in 
the critical regime the heuristic is almost never used, only 
UP is used, hence h[p) is universal, or, in other words, in- 
dependent of the heuristic). The subsequent non-critical 
time regime N^/^ < N brings only a finite contribu- 
tion to (Co). Therefore, algorithms using UP give an upper 
bound to the ground state energy of 2 -I- p-SAT problems 
with p < 2/5 that grows with a like h{p){a — a-p.)^ . This 
speculation agrees well with numerical results for the aver- 
age Co at the end of runs of UC=UP-|-R or UP-I-HL on in- 
stances of 2-SAT, see Figure [HI The scahng (Co) cx iV^/i^ 
is also numerically correct (not shown on the figure). 
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Fig. 12. Numerical test of the scaling relationship l)13()^ for 
the UC=UP-)-R and UP-I-HL algorithms on 2-SAT (data are 
for heuristic R when not otherwise stated). The rescaled aver- 
age number of 0-clauses {{Co} /N)^''^ is plotted vs. the initial 
clauses-per- variable ratio a. The error bars are smaller than 
the symbol's sizes. The solid line is a tentative fit with slope 
0.65 « 0.27^/^ For a = 1, finite -size effects are strong since 
(Co) decreases with A'' as A''"^''^^, which is consistent with 
eq. 11361 . Using the HL heuristic rather than the fully random 
R heuristic gives some improvement for finite sizes, but no im- 
provement in the thermodynamic limit because the heuristic 
plays no role in the critical regime. 

The exponent 3 in h{p){a~ a^)^ agrees with the (non- 
rigorous) results for the ground state energy of random 
2-SAT computed in Section VI of However, the nu- 
merical estimate h{0) « 0.27 (see Fig. I12|l is clearly dif- 
ferent from the coefficient 0.15 computed in Section VI 
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of the bound that UP provides is not tight As a 
side remark, general rigorous results '28'"29' show that, us- 
ing techniques quite different from UP, one can design an 
algorithm running in polynomial time that finds, for any 
instance of random 2-SAT, an assignment which satisfies 
at least 93.1% of the number of clauses that can be satis- 
fied simultaneously in that instance (let us call this num- 
ber optimum of the instance; it is the difference between 
the number of clauses and the ground state energy). Con- 
versely, it is proved impossible to find a polynomial time- 
algorithm that would, for each instance, satisfy simulta- 
neously a number of clauses of at least 21/22 « 95.5% of 
the optimum (for 3-SAT, these two bounds become 7/8). 
For 2-SAT, our results state that, in average, the poly- 
nomial time greedy algorithms that use UP outperform 
these worst-case bounds for a close to an, (they satisfy in 
average a number of clauses that tends to the optimum as 
a tends to or , although on some rare instances they may 
perform badly). It might be interesting to know the aver- 
age behaviour of the algorithms of 2S 291 and see whether 
they get closer to the ground state energy as algorithms 
based on UP. 



7.2 Interpretation as a random graph percolation 
phenomenon 

Introduction of the oriented graph G representing 1- and 
2-clauses allows us to re-interpret some of the scalings that 
we found in the previous sections. G is made of 2 {N — T) 
vertices (one for each variable Xt and its negation Xi), 
Ci marked vertices (one for each 1-clause Zi), and 2C2 
edges {zi V Zj is represented by two oriented edges Zi Zj 
and Zj —> Zi) [Tlll7| . d2 is simply the average (ingoing or 
outgoing) degree of vertices in G- 

A step of UP removes a marked vertex, say z, and 
its attached outgoing edges, after having marked its de- 
scendants (2-clauses with z are reduced). If z appears in 
some 3-clauses, these clauses become 2-clauses, i.e. new 
pairs of edges in G- UP-steps are repeated until no vertex 
is marked, then some vertex is marked according to the 
heuristic of the algorithm and another round of UP starts. 
During a round, there is a competition between the elimi- 
nation of edges and vertices and the creation of new edges. 
All vertices in the (outgoing-edges) connected component 
of the initial G that started with the first marked ver- 
tex are removed. In addition, the vertices that have been 
linked to this component through the new edges are also 
removed. A contradiction arises (a 0-clause appears) when 
two conjugate vertices z and z are marked. When G per- 
colates (this happens for ^2 > 1 |^. 17 ) , there exist many 
oriented loops going from one literal z to its conjugate 
and back to it ^ . Marking a vertex of such a loop results 
sooner or later in the marking of both z and z, and the 
algorithm fails. Conversely, if G doesn't percolate, there 
is no such loop with finite probability. The success / failure 
transition of the algorithms using UP is related to the 

^® It might be that the exact ratio of h{p) to the coefficient 
of the ground state energy is 2. 



percolation transition of Cy; if the value of c?2 is bounded 
away from 1, G never percolates and the probability of 
success if finite. Although the results for percolation of 
random graphs were obtained in a static context, they ap- 
ply readily to the graph G that is kinetically built, because 
of conditional uniformity (see Sect. I3.l|l . 

Notice that the percolation phenomenon of random 
graphs is very robust. For instance, taking random graphs 
conditioned to a certain degree distribution of the ver- 
tices '31, does not change the universality class if the prob- 
ability of the large degrees decays not too slowly. Such 
(directed) graphs G appear in the context of algorithms 
with the HL or CL heuristics that select literals accord- 
ing to their degrees. We have checked numerically for HL 
that the degree distribution in G is far from the Poisson 
distribution that we find for non-conditioned C/'s (e.g. for 
R and GUC). Still, the probabilities of finding vertices of 
high degrees decrease fast and G has the same percolation 
critical behaviour as a random graph. A parallel can be 
drawn between the robustness of the critical behaviour of 
random graphs and, in our computation, the weak depen- 
dence of the probability law for Ci on that of C2 (only the 
average of C2/N matters thanks to self- averaging). 

In the percolation critical window 1^2 — 1| ~ A^~^/^ jSDl 
[Tt] . the probability that a vertex belongs to a component 
of size S is Q{S) ~ 5-3/2 ^^^^ cut-off equal to the 
largest size, 

7V2/3^ Prom Fi gure|21 departure ratios a have 
to differ from ckfiat by N^^"^ for resolution trajectories to 
fall into the critical window. Hence the critical window in 
a has width N-^/^ for both the 2-SAT family and the 
3-SAT class. 

When the resolution trajectory is tangent to the c?2 = 1 
line (3-SAT class), it spends the time At ^ \/\d2 — 1| ^ 
7V^i/s in the critical window, corresponding to AT — 
NAt^ N^/^ eliminated variables. Let 81,82, ■■ -,3 j be 
the sizes of components eliminated by UP in the criti- 
cal window; we have J - AT / d8 Q{8) 8) - N^'"^ . 
During the elimination, the number of marked ver- 
tices 'freely' diffuses, and reaches Ci ^ \fSj- The prob- 
ability that no contradiction occurs is [(1 — q)'~^^]^^ ^ 

exp(— S'^^^/iV) where g ~ is the probability that a 
marked vertex is the negation of the one eliminated by 
UP. Thus - InPsucccss ^ Jjd8Q{8) 8^/yN ^ N^/\ giv- 
ing A = ^. Notice that, while the average component size 
is S" ~ N^^^ (and thus the probability that Ci vanishes 
in the critical time regime is ~ N'^^/'^, consistently with 
Eq. IHEl), the value of A is due to the largest components 
with 8 ^ N'^/'^ i.e. Ci ~ N^/^ marked vertices. 

In the case of the 2-SAT family, the algorithm elim- 
inates only AT ~ iV^/^ variables and J ~ N^^^ con- 
nected components. The estimation above yields a finite 
— In Psuccess and fails to predict the correct answer because 
it doesn't take the In TV corrections into account. 

For general iiT-SAT, one may attach to an instance a 
family of oriented hypergraphs were vertices are the lit- 
erals and the ^-clauses are ^-hyperedges, for 2 < I < L 
and some fixed integer L between 2 and K. The first 
graph of this family, ior L = 2, is G- The critical be- 
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haviours exhibited in Section 16.31 appear when the first 
hypergraphs, 2 < Z < L for some i < L < K, per- 
colate. This happens ^| for clauses-per-variable ratios 
ai = 2'-y[l{l - 1)] for 2 < I < L; then - InP^^ccoss - 
withA = i(l-^). 

7.3 Unit-Propagation in other problems 

The Unit-Propagation rule that we defined for SAT prob- 
lems may easily be generalized to other contexts. In the 
graph coloring problem, one wants to know if the vertices 
of a given graph may be colored with K colors in such a 
way that no two vertices that share an edge have the same 
color. A family of algorithms that deal with such graphs 
processes the graph by maintaining the list of available 
colors on each vertex (initially the list of all K colors) 
I34| . When a vertex is colored with some color, this color 
is removed from the list of available colors of the neigh- 
bours of this vertex. The UP rule would be defined as: 
"When, on the partially colored graph, at least one ver- 
tex has only one available color, color it with this color 
prior to any other action" . We expect eq. H34|) to hold 
also for such graph coloring algorithms, after choosing a 
relevant value for C2. This value may be computed by the 
technique of differential equations PSIIM] like in eqs. (|77|l . 
The probability of success of the randomized, greedy col- 
oring algorithms should behave like Psuccoss for X-SAT; in 
particular, for the case of 3 colors, it should have a success- 
to-failure transition with critical behaviour in N^^^. 

Similarly, the study of greedy algorithms using UP for 
other hard decision computational problems |35j should 
lead to the same critical behaviour (that is, problems where 
the answer of the algorithm should be either "yes - satisfi- 
able - colorable - ..." or "no - unsatisfiable - uncolorable 
-..."). 

In the framework of error-correcting codes, the authors 
of reference jBBj also found a critical behaviour that is 
governed by a Brownian motion with parabolic drift 24 , 
which leads to Airy distributions, as we found for Ci in 
Section El 

There is another important family of problems, called 
optimization problems, where the answer is a number. For 
example, in the MAX-SAT problem, believed to be a hard 
computational problem, one wants to know, for some in- 
stance of SAT, the maximal number of satisfied clauses 
or, equivalently, the ground state energy, i.e. the minimal 
number of violated clauses over all assignments of vari- 
ables of this instance. In the context of graph coloring, 
one may look for the minimal number of colors needed 
to color some graph, etc. Knowing whether their random 
formulation (where instances are drawn uniformly at ran- 
dom) fall into the same universality classes as the random 
SAT problem requires more investigation. However, each 
optimization problem has a decision version, which can be 
related to this work. For instance, the decision version of 
random MAX-2+p-SAT would be: "is there an assignment 
of the variables of this instance such that the number of 
violated clauses does not exceed some fixed number m?" 
We can deal with it by asking for the probability that the 



greedy algorithm using UP ends, after having assigned all 
variables, with a number Co of 0-clauses that does not 
exceed m. As long as m is bounded in the large N limit, 
this new success probability can be computed by summing 
the 0th, 1st, 2nd, mth derivatives of 001(^07 1; ^^1^2) 
with respect to Xo in Xq = 0. A dynamic phase tran- 
sition between success and failure phases is found as for 
the random 2-|-p-SAT problem and, both for p < 2/5 and 
p> 2/5, the largest term in the expression for In -Psucccss 
at criticality is independent of m: the critical exponents 
are unchanged, although subdominant terms now depend 
on m (in particular, for p < 2/5, terms in mln(ln(A'^)) 
appear in the expression of — In Psuccoss at criticality) . We 
expect the exponents to remain the same for algorithms 
using UP on the decision version of other optimization 
problems. It would be interesting to know what happens 
when m is allowed to become large with N. 

Another interesting question is whether instances of 
SAT (or other problems) with strong correlations, such 
that self-averaging of C2 is lacking, would lead to new 
universality classes. This could be the case for instances of 
SAT such that G can be embedded into a finite-dimensional 
space, or with a power-law distribution of the number of 
occurrences of variables in clauses, possibly built by pref- 
erential attachment of new variables to highly-connected 
old variables. Graphs with such features are observed in 
real-world applications such as the 'World Wide Web' or 
social networks; they might be helpful in modeling indus- 
trial instances of SAT better than the random, 'flat' dis- 
tribution. 
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Appendix A: Numerical estimate of the 
probability of satisfiability for critical 2-SAT 

Reference ^Z) rigorously proves that the probability of a 
random 2-SAT formula of size N with initial constraint- 
per-variable a = 1 is finite (whereas it vanishes for large 
iV if a > 1). A numerical estimate of this probability was 
given on figure 5 of reference ^D, but for very small sizes 
N (up to 90). We give here more precise results, based on 
numerical estimates of the probability of satisfiability for 
formulas of sizes N = 500 to 5 x 10^. For each size N up to 
10®, we drew at random more than 3 x 10® instances (for 
= 5 X 10®, we drew only 10^) and we determined if they 
were satisfiable or not thanks to the well-known algorithm 



C. Deroulers and R. Monasson: Criticality and Universality in the Unit-Propagation Search Rule. 



27 



of reference Q]. This algorithm finds, by depth- first explo- 
ration, the strongly connected components of an oriented 
graph built from the 2-clauses, similar to the graph we in- 
troduced in Section Results from the simulations are 
plotted in Figure [T^ They appear to be fully consistent 
with the finite-size scaling hypothesis 



and yield 



0.907 ± fO" 
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Fig. 13. Numerical estimates of the probability of satisfia- 
bility of critical (a = 1) random 2-SAT formulas for different 
sizes A'', plotted vs. N~^^^. The straight line is a guess for the 
tangent to the curve at the point where it cuts the N — oo 
axis. 



Appendix B: Path-integral formalism for the 
kinetics of search 

The standard, yet non-rigorous, technique of path inte- 
grals is another tool to study the search process. Let us 
re-derive some results of Sections IS! and I^TI with it; this 
will show the correspondence between, on one hand, K- 
SAT related quantities and physical quantities such as 
moments, and, on the other hand, generating functions 
techniques and path-integral techniques, hopefully bring- 
ing more insights to both approaches. 

Wc start from the evolution equation (UJ and write 
the probability P{T) that the search process doesn't pro- 
duce any 0-clause from times to T as an iteration (this 



quantity was G{0, f , f , . . . , f ; T) in Section 13)1: 



T-l 



B'j' B'j' — i B\ 

Bt-2;T^2) X 



- E E 

Ci , . . . .Ot^ _ 1 Bi,....£ 
/ T 



" dyi 



X Af(Bi^Co;0) 



{2^y< ■ ■ ■ (2^) 



K 



(B.l) 



T-l 



exp iyL-iBL Cl)]1[ M{Bl+i ^ Cl; L) 



L=0 



with Co := (0, 0, ... , 0, aN) and = — f . All clause vec- 
tors are, in this Appendix, of dimension K instead of K-\-l 
since the number of 0-clauses is always zero. The y^'s con- 
strain the Ci's to mimic the B^s, so that we can write 
the formally quadratic expression ^ Bl;L) as 

the uncoupled expression M[Bl+i ^ Cl',L)- Carrying 
out the sums over the S^'s first, we obtain 



P{T) :=^P(CT;r) 



(B.2) 



E 



. (2^) 



K 



(27r)^ 



exp 



E m-CL ) 

L=l ) 



xexp|^^ln[/(Xi;L)].CL^ 

T r 

where (X/^), — exp{i{yL)i), the vector-valued function 
/ = (/ii /2, • ■ • , fx) is defined through ((T^ and the dot 
denotes the usual scalar product of M^. In the large N 
limit, a continuous formulation for P{T) can be obtained. 
Let us define the reduced time, £ ~ L/N and the clause 
densities c{£) = C{L)/N. Define for i I, . . . , X 



(B.3) 



with ?/o := — cxD, and 7 := (71, . . . , ^k)- The probability P 
at time T = tN can be written as a path integral over the 
values of clause densities c and (from now on complex) 
'momenta' y between times and i, 

(■c(t) = (ci,C2,...,CK) 

P{tN) = / Vc{l) Vy{l) X 

Jc{0) = {a,0,...fi,a) 

exp(^-NS[{c{£),y{e)}]^ (B.4) 
where the action reads 

s[{cii),yii)}] = iyif^y{i).^^ie)-'r{yiey,e).c{e) 

-In [p,{e) e-^^W + f-pi(^)]l(B.5) 
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where pi{£) denotes the probabiUty that there is at least 
one unit clause at time i i.e. the number of instants L 
such that Ci (L) > 1 between L = iN and L = iN + 
AL, divided by AL, with 1 <^ AL ^ L. Minimization of 
the action IjB.Sp yields the classical equations of motion. 
Differentiating with respect to momenta, we find 



c — boundary) constrain the PDF p to be concentrated 
around the c — boundary: p(c, will be non-vanishing 
only for values of c that tend to as i ^ -l-oo. With this 
choice of scale, eq. (ICljl turns at the leading order in s 
when s into 



SS 



da 



Piii) 



dlg{u, s) + 2pdug{u, s) = 
9„5(0, s) + 2/?g(0, s) = 



(C.2) 
(C.3) 



p^{l) + {l- p,{i))ev- 



(B.6) 



for i — 1, . . . , K . Some care has to be brought to the mini- 
mization of the action with respect to the clause densities 
since pi and ci are not independent. During the time in- 
terval [£,i + d£], the number Ci of unit clauses is either 
of the order of (ci > and pi = 1) or of the order of 
unity (ci = and pi < 1) With this caveat, we obtain 



^=„^*^(fl = -,.(,m;f) (, = 2 K) 



hence g(u^s) — A(s) exp(— 2/3u): c has at large times an 
exponential statistical distribution, with average (condi- 
tioned to success of the greedy algorithm) 0(^2) = 1/(2/3*2)- 
To actually compute the normalization factor A{s), we 
need to introduce formally the correction to this leading- 
order term. Let p{c,t2) —■ A{s) exp{—2f3u) + B{s)h{u,s) 
with the requirement B{s) ^ A{s) as s ^ 0. The leading- 
order terms in eq. IClj) (after cancellation of the formerly 
leading-order terms) satisfy 

[~2/3suA{s) - s'^dsA + us{l - Xo)A{s)/2] cxp(-2/3u) 



'-B{s) {dlh + (3duh) 



(C.4) 



and 



(B.7) 



with the boundary condition duh{0, s) + 2/3/i(0, s) = 0. 
This yields, after integration from u = to -l-oo (we as- 
sume that h{u,s) — > when u — > +00 just like p does). 



if ci{£) > 0, then pi{£) = 1 and 



A{s) cx s ^^^^ 
and, for the leading order of p(c, ^2) at large times: 



(C.5) 



Scii£) 



d£ 



if ci{£) ~ 0, then pi{£) is given by equation ljB.6|l for 
i = 1, 



p(c,t2) (X t2 



exp(~2/3ci2). 



(C.6) 



This shows that the probability of success of the algorithm 
decays algebraically at large times 12'- 



Eq. (|B.9p makes sense if the r.h.s. is larger than unity. 
In the case where yi = 0, it agrees with eq. (001 • 



Appendix C: Large t2-expansion of the 
solution p of the PDE d77|i 

In this appendix we explain the method we used to get an 
asymptotic expansion of the solution of eq. l(77|) at large 
times t2, and we quote the results. Eq. ifTzjl is here treated 
in the general case Xq £ [0, 1] and reads 

dt^p{c,t2) = ■^d'^p{c,t2)+l3{p)t2dcp{c,t2) ^cp{c,t2) 

(C.l) 

First, we change the variables c and t for u := ct2 and 
s := 1/^2 respectively, and define g as g{u,s) := p(c, t2). 
This is motivated by the expectation that, at large times 
^2, both the drift term (that pushes diffusing particles to- 
wards the c — boundary) and the absorption term (that 
kills situations where many particles are away from the 

This statement is true outside the critical regime where Ci 
and pi are both vanishing as (negative) powers of A^. 



Subleading orders in the expansion of p at large t2 may 
be found iteratively by the same technique. For the sake 
of completeness, let us quote here what we found: 



p(c,i2)oc4-(^-^")/^/''e-2'3ct. ^ 
l-Xo\ f 2,2 6/3 



(C.8) 



1 - 



4/3 



l+Xr 



12/33 



t^3 + 



4/3 



[(6/3^X0 + 24/3* - 6(3^)chl+ 



48/34 

(48/3^ -I- 8/32^0 - SP^)ct2+ 
(-2/3 -1- 2X0/3 + 2X0 + 48/3^ - 1 - X^)] 
6/3 - 1 + Xq 
1152/37 ^ 

(384/3^ - 50/3 + 50X0/3 - 1 + 2X0 - X^)^-^ + o(r^) 

and, for the conditional average of c: 
1 , 4/3-(l-Xo) 



2/3*2 



8/34*4 



32/37* 



J [4/3 - (1 - Xo)] [5/3 -{1-Xo 



(C.9) 

+ o(/3-V)- 
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Notice that the expansions here can only be asymp- 
totic series, without a finite radius of convergence (as a 
function of s = 1/^2), because they are independent of 
the initial conditions. Indeed, fix e > and suppose that 
the sum of the expansion for p above is the right solution, 
up to a uniform small difference e, on a finite interval of ^2, 
say ]A, -|-oo[. Let Papprox(c, i2) be this sum of the expan- 
sion. Then ask for the solution of eq. H77|) that has initial 
condition p{c, A+\) — /Oapprox(c, ^-|-l) + 10e: it should also 
be Papprox(c, ^ + 1) up to a difference e — contradiction! 
Therefore, the precision of the approximations obtained 
with partial sums of the expansion above decreases as we 
try to get values for smaller and smaller times ^2, and we 
should keep that in mind when we use the present results. 



Appendix D: Small t2-6xpansion of the 
solution p of the PDE (j77|i 

In this Appendix we explain the method we used to get an 
asymptotic expansion of the solution of eq. (|C.1|I at small 
times t2, and we quote the results. 

We use the exact solution of eq. (jC.ip without the non- 
linear term cp(c, (that is, for the special value = 1) 
as a guide for a relevant choice of variables. Disregard- 
ing the boundary condition eq. H79|) . we easily solve the 
resulting linear PDE thanks to Fourier transform: 



P\h- 



r(c,<2) = 



constant 



1 



^2ii 



(D.l) 



exp ■ 



2(^2 — ^2init) 



''2init^ 



which suggests us to use the following variables for our 
expansion: 



V := to 



''2init, 



c + /3(p)/2(i2_t2.^.J 



in it 



p(c,t2) =: —^g{u,v) . 

The equations for g are 

2vd,g = dig + udug + (D.2) 
1 - (1 - Xoy/^u + (1 - Xo)/3(p)/2t;2(2t2i„it + v) 

dug[P{p) /2y^{2t2init + v),v] = (D.3) 
-2/3(p)\A;(i2init + v)g[(3ip)/2V^{2t2init + v),v] 



At t2 = t2init, i-C. at w = 0, g is Gaussian, as expected from 
the study of the linear equation (with Xq = 1): g{u, 0) = 
V2tt~^ exp(— u^/2). Letting v''h{u, v) be the deviation, at 
positive times v, between the exact g and this Gaussian 
expression, and substituting this into the equations for 
we see that the boundary condition constrains 7 to the 
value 1/2, and we are led to the ODE for h{u, 0) 



dlh{u,0) + uduh{u,0) = 



(D.4) 



with boundary condition duh{0,0) — —2(3{p)t2init/V^- 
Adding the physical requirement that h(u, 0) — s- as w — > 
-l-oo, we find the unique solution h{u,0) = /3(p)t2init[l ~ 
erf('u/-\/2)]. Going on with this iterative process, we find 
that p{c,v = t2 ~ i2init) has the following expansion at 
small v: 



p(c, v) = \ — exp 
" nv 



(c -I- (3vwY 



2v 



l + 2/324„it«(D.5) 

2i„it I + 0{v) + 
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2(3H 
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where w stands for (2i2init + v)/2. 
for the conditional average of c: 



+ 1 



Hence the expression 
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Appendix E: Direct numerical solution of the 
evolution equation for the generating 
function Gi 

For limited sizes N of the problem, eq. (|34|l may be solved 
directly, with exact or high precision computer arithmetic. 
It may be used to compute the probability of success of 
the greedy algorithm from time steps to T, Gi(l, T) or 
the distribution of Ci from the generating function Gi. 

We assume that Xq = and that C2 is known (from 
e.g. Eq. idnil)- At T = A^, Ci is either 1 or and a use- 
ful set of values of Xi in eq. is 0, 1 only. To com- 
pute Gi{Xi,N) with Xi = or 1, it is enough to know 
Gi{Xi,N ^ 1) with Xi = 0, 1/2 or 1 after eq. (O (we 
set Xq — 0). This in turns requires successively the values 
G(Xi,T) for all Xi e 0,1/[2{N - T)],1/{N - T), . . . ,1. 
Starting from the known initial condition G{Xi,0) = 1, 
it is possible to compute iteratively, from T = to A^, all 
values G{k/[2{N ~ T)],T) for < fc < 2{N - T) (at each 
step, only 2(A^ — T) -I- 1 numbers are stored). In practice, 
getting accurate results requires either doing exact arith- 
metics (which may be quite slow and/or memory consum- 
ing) or working with high precision floating point arith- 
metic (roughly speaking, the number of decimals digits 
has to be equal to A^). This limited the range of A^ to a 
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few hundreds on the computer we used (2.4GHz Pentium 
IV processor with 1GB of RAM). 

Using rounded, integer values for C2 rather than sim- 
ply N XC2 from eq. (|29|l makes computations much faster. 
In the case where exact arithmetic is used, it is needed to 
manage only rational numbers. For N large enough, it 
brings only a small error on Gi . 

If the number of 1-clauses at time T is bounded by 
2(iV - T), Ci is a polynomial of degree at most 2{N - T). 
Knowing its 2{N ~ T) + I values G{k/[2{N - T)],T) is 
enough to compute it: Gi may be expressed as a weighted 
sum of Lagrange interpolating polynomials. Finally, the 
coefficients of Gi{Xi) indicate the probability law of Ci. 
We used this technique to get the numerical results for 
the distribution of Gi for 2-SAT with N = 501 that are 
plotted in Figure ||SJ|. Contrary to estimates based on runs 
of the very algorithms under study, it doesn't require av- 
eraging over many series since there is no randomness in 
the computation. 



References 

1. B. Aspvall, M. F. Plass, and R. E. Tarjan. A linear-time al- 
gorithm for testing the truth of certain quantified boolean 
formulas. Inf. Process. Lett, 8:121-123, 1979. 

2. C. Papadimitriou and K. Steiglitz. Combinatorial Opti- 
mization. Dover, 1998. 

3. John Franco. Probabilistic analysis of the pure literal 
heuristic for the satisfiability problem. Annals of Oper- 
ations Research, 1:273-289, 1984. 

4. John Franco and R. P. Swaminathan. Average case re- 
sults for satisfiability algorithms under the random clause 
model. Annals of Mathematics and Artificial Intelligence, 
20:357-391, 1997. 

5. Ming-Te Chao and John Franco. Probabilistic analysis of a 
generalization of the unit-clause literal selection heuristics 
for the k-satisfiability problem. Inform. Sci., 51(3):289- 
314, 1990. 

6. Dimitris Achlioptas. Lower bounds for random 3-sat via 
differential equations. Theor. Comp. Sci., 265(1-2):159- 
185, 2001. 

7. Ming-Te Chao and John Franco. Probabilistic analysis of 
two heuristics for the 3-satisfiability problem. SIAM J. 
Comput., 15(4):1106-1118, 1986. 

8. Alan Frieze and Stephen Suen. Analysis of two simple 
heuristics on a random instance of k-sat. J. Algorithms, 
20(2):312-355, 1996. 

9. Christophe Deroulers and Remi Monasson. Critical 
behaviour of combinatorial search algorithms, and the 
unitary- propagation universality class. Europhys. Lett., 
68(1):153-159, October 2004, cond-mat/0405319. 

10. Remi Monasson, Riccardo Zecchina, Scott Kirkpatrick, 
Bart Selman, and L. Toyansky. Phase transition and search 
cost in the 2-|-p-sat problem. In ^ih Workshop on Physics 
and Computation, Boston University, November 1996. 

11. Scott Kirkpatrick and Bart Selman. Critical behavior in 
the satisfiability of random boolean expressions. Science, 
264:1297-1301, 1994. 

12. Bart Selman and Scott Kirkpatrick. Critical behavior in 
the computational cost of satisfiability testing. Artificial 
Intelligence, 81:273-295, 1996. 



13. Dimitris Achlioptas, Lefteris M. Kirousis, Evangelos 
Kranakis, and Danny Krizanc. Rigorous results for ran- 
dom (2 -f p)-sat. Theor. Comp. Sci., 265(1-2): 109-129, 
2001. 

14. Olivier Dubois, Yacine Boufkhad, and Jacques Man- 
dler. Typical random 3-sat formulae and the satisfiabil- 
ity threshold. In Proc. 11th ACM-SIAM Symposium on 
Discrete Algorithms, pages 126-127, 2000. 

15. Alexis C. Kaporis, Lefteris M. Kirousis, and Efthimios C. 
Lalas. The probability analysis of a greedy satisfiability 
algorithm. In Proc. 10th Ann. European Symp. on Algo- 
rithms, pages 574-585, 2002. 

16. David B. Wilson. On the critical exponents of random 
fc-sat. Random Structures and Algorithms, 21(2):182-195, 
2002, math.PR/0005136. 

17. Bela BoUobas, Christian Borgs, Jennifer T. Chayes, 
Jeong Han Kim, and David Bruce Wilson. The scaling 
window of the 2-SAT transition. Random Structures and 
Algorithms, 18(3):201-256, 2001. 

18. Haiou Shen and Hantao Zhang. An empirical study of 
max-2-sat phase transition. In Proc. of LICS'03 Work- 
shop on Typical Case Complexity and Phase Transitions. 
Ottawa, Canada, June 2003, 2003. 

19. M. Davis and H. Putnam. A computing procedure for 
quantification theory. J. ACM, 7:201-215, 1960. 

20. Gilles Dequen and Olivier Dubois, kcnfs: an efficient solver 
for random k-sat formulae. Lecture Notes in Computer Sci- 
ence 2919, Theory an Applications of Satisfiability Testing, 
pages 486-501, 2004. 

21. Dimitris Achlioptas and Gregory B. Sorkin. Optimal my- 
opic algorithms for random 3-SAT. In IEEE Symposium 
on Foundations of Computer Science, pages 590-600, 2000. 

22. Nicholas C. Wormald. Differential equations for random 
processes and random graphs. The Annals of Applied Prob- 
ability, 5(4): 1217-1235, November 1995. 

23. J. R. Albright. Integrals of products of airy functions. J. 
Phys. A - Math. Gen., 10:485-490, April 1977. 

24. Piet Groeneboom. Brownian motion with a parabolic drift 
and airy functions. Probability Theory and Related Fields, 
81:79-109, February 1989. 

25. Milton Abramowitz and Irene A. Stegun. Handbook 
of Mathematical Functions with Formulas, Graphs, and 
Mathematical Tables. Dover, New York, ninth dover print- 
ing, tenth gpo printing edition, 1964. 

26. Harris J. Silverstone, Sachiko Nakai, and Jonathan G. Har- 
ris. Observations on the summability of confluent hyperge- 
ometric functions and on semiclassical quantum mechan- 
ics. Phys. Rev. A, 32:1341-1345, September 1985. 

27. Remi Monasson and Riccardo Zecchina. Statistical me- 
chanics of the random k-sat model. Phys. Rev. E, 
56(2): 1357-1370, August 1997, cond-mat/9606215. 

28. Johan Hastad. Some optimal inapproximability results. In 
STOC '97: Proceedings of the twenty-ninth annual ACM 
symposium on Theory of computing, pages 1-10, New 
York, NY, USA, 1997. ACM Press. 

29. Johan Hastad. Some optimal inapproximability results. J. 
ACM, 48(4):798-859, July 2001. 

30. S. Janson, T. Luczak, and A. Rucinski. Random Graphs. 
Wiley, 2000. 

31. M. E. J. Newman, S. H. Strogatz, and D. J. Watts. Ran- 
dom graphs with arbitrary degree distribution and their 
applications. Phys. Rev. E, 64:026118-1-17, July 2001. 



C. Deroulers and R. Monasson: Criticality and Universality in the Unit-Propagation Search Rule. 



31 



32. Christina Goldschmidt. Critical random hypcrgraphs: 
the emergence of a giant set of identifiable ver- 
tices. Annals of Probability, 33(4):1573-1600, July 2005, 
math.PR/0401208. 

33. D. Achlioptas and M. MoUoy. The analysis of a list- 
coloring algorithm ou a random graph. In FOCS '97: Pro- 
ceedings of the 38th Annual Symposium on Foundations of 
Computer Science (FOCS '97), pages 204-212, Washing- 
ton, DC, USA, 1997. IEEE Computer Society. 

34. Liat Ein-Dor and Remi Monasson. The dynamics of prov- 
ing uncolourability of large random graphs, i. symmetric 
colouring heuristic. J. Phys. A - Math. Gen., 36:11055- 
11067, October 2003. 

35. M. R. Garey and D. S. Johnson. Computers and In- 
tractability: a guide to the theory of NP-completeness. W. 
H. Freeman and Co., New York, 1979. 

36. Abdclaziz Amraoui, Andrea Montanari, Thomas J. 
Richardson, and Riidiger L. Urbanke. Finite-length scal- 
ing for iteratively decoded Idpc ensembles. In Proc. 41th 
Annual Allerton Conference on Communication, Con- 
trol and Computing, Monticello, IL, Oct 2003, 2004, 
cs.IT/0406050. 



